How to deploy an AI support agent that deflects 60–80% of tickets, integrates with your helpdesk, and escalates to humans when it matters.
H
Hestur AI
hestur.co
60–80%
Ticket Deflection
resolved without human agent
<30 sec
First Response
vs 4–8 hr industry avg
100%
24/7 Coverage
no off-hours queue
4.1+/5
CSAT Target
for AI-handled tickets
What "60–80% deflection" actually means: Your bot handles FAQ, order status, password resets, policy questions, and simple troubleshooting. Humans handle escalations, complaints requiring judgment, and complex multi-step issues. This is not about replacing your team — it's about freeing them for the 20–40% of tickets that actually need them.
Step 1 — Audit Your Current Tickets
Before building anything, pull 3 months of ticket data and categorise by type:
Category
Typical %
Bot Can Handle?
Order status / tracking
20–30%
Yes — with API integration
FAQ / product questions
15–25%
Yes — with knowledge base
Password / account reset
10–15%
Yes — with auth API
Returns / refunds
10–20%
Partial — policy Q yes, processing needs human
Complaints / escalations
5–15%
No — route to human immediately
Complex troubleshooting
10–20%
Partial — first 2 troubleshooting steps
If your top 3 categories account for 50%+ of tickets, you have an excellent bot candidate. If your tickets are mostly unique, complex problems — a bot will frustrate customers.
Step 2 — Build Your Knowledge Base
The knowledge base is the foundation of ticket deflection. Quality here directly determines bot accuracy.
1
Export your top 50 questions
Pull from helpdesk analytics (Zendesk, Intercom, Freshdesk all have this). These become your first KB articles.
2
Write structured answers
Each answer: max 150 words, simple language (no jargon), one clear action per step. Avoid answers that start with "It depends" — the bot will hedge and frustrate users.
3
Add structured data sources
Connect live data the bot can query: order status API (what's my order?), account lookup (subscription status, billing date), product catalogue (specs, availability).
4
Create escalation triggers
Define exact phrases and keywords that immediately route to human: "lawyer", "refund", "broken", "never works", "cancel subscription", "billing error". No bot attempts, straight to human.
Step 3 — Choose Your Deployment Surface
Surface
Best For
Integration
Website chat widget
SaaS, e-commerce, any web-first business
Intercom, Freshchat, or custom widget
Email auto-response
High email ticket volume
Email catch-all → AI → reply or route
WhatsApp / SMS
Mobile-first users, international customers
WhatsApp Business API + n8n
In-app chat
SaaS products with logged-in users
Intercom or custom SDK
Voice (phone)
Businesses with high call volume
Vapi + knowledge base RAG
Step 4 — RAG Architecture for Accurate Answers
For a bot that answers from your specific knowledge base (not the LLM's general knowledge), you need Retrieval-Augmented Generation:
1
Chunk and embed your KB
Split each KB article into 300–500 token chunks. Embed with OpenAI text-embedding-3-small. Store in Pinecone or Weaviate. Cost: ~$1 per 1M tokens.
2
Retrieve on each query
When a customer asks a question, embed it and find the top-3 most similar KB chunks. Pass those chunks + the question to the LLM.
3
Prompt the LLM with context
"Answer the customer's question using only the provided context. If the answer is not in the context, say 'I don't have that information — let me connect you with our team' and escalate."
4
Log and improve
Track queries that return "I don't have that information." Add those answers to your KB weekly. Accuracy typically improves 15–20% in the first month.
Step 5 — Helpdesk Integration
✓For Zendesk: use the Zendesk API to create tickets for escalated queries, preserve full conversation history
✓For Intercom: bot lives inside Intercom as a custom bot — seamless handoff to inbox
✓For Freshdesk: webhook on escalation → create ticket with transcript + sentiment label
✓Always include conversation transcript in the ticket — humans should not have to ask again
✓Tag bot-handled tickets as "Bot resolved" — track weekly deflection rate from these tags
Step 6 — Escalation Design
Good escalation is the difference between a bot that frustrates customers and one they trust:
Trigger
Bot Behaviour
Human Notification
3rd attempt to answer same question
"Let me get someone who can help you better"
Immediate — Slack ping to on-duty agent
Negative sentiment detected
Acknowledge frustration, offer human immediately
Slack ping with sentiment score
Billing / payment issue
Do not attempt — route instantly
High priority ticket creation
Safety / legal language
Immediate escalation, no response attempted
Urgent flag to team lead
30-Day Launch Checklist
Week
Task
Week 1
Ticket audit, top-50 Q&A written, KB structure designed
Week 2
RAG pipeline built, API integrations tested, bot deployed in staging
Week 3
Shadow mode on live traffic, escalation triggers tuned, staff training
Week 4
Go live, daily monitoring, KB gaps filled, CSAT collection active
KPIs to Track Monthly
Target 60%+
Deflection Rate
resolved without human
Target 4.0+
Bot CSAT
post-resolution survey
Target <15%
False Escalation
bots that didn't need to escalate
Track & reduce
Knowledge Gap Rate
unanswered queries per week
Want this implemented for your business?
We scope most projects in 48 hours. Fixed price, 2–4 weeks to deploy.