How to Measure Customer Service Performance With AI
The Five Core Metrics
1. AI Resolution Rate
This is the percentage of customer conversations that the AI resolves completely without escalating to a human agent. A well-trained AI chatbot should resolve 60-80% of conversations after the first month. Track this number weekly and watch for trends. If it plateaus below 60%, your knowledge base has gaps. If it drops suddenly, something changed, maybe your product, pricing, or policies updated and the training data is stale.
To calculate: count the total conversations the AI handled and subtract the ones that escalated to a human. Divide by total conversations. A conversation counts as "resolved" if the customer got an answer and did not ask for a human.
2. Average Response Time
AI response time is nearly instant, typically under 3 seconds. The metric that matters more is the blended response time across AI and human-handled conversations. If your AI handles 70% of conversations instantly and humans handle 30% with an average 5-minute wait, your blended average is about 1.5 minutes. Track the human response time separately to identify staffing gaps and peak volume periods.
3. Customer Satisfaction (CSAT)
Ask customers to rate their experience after each conversation. A simple thumbs up/thumbs down or 1-5 star rating at the end of the chat works well. Track CSAT separately for AI-resolved conversations and human-resolved conversations. If AI CSAT is significantly lower than human CSAT, the AI is giving unsatisfying answers and needs better training data or a refined system prompt.
4. Escalation Rate
The percentage of conversations that transfer from AI to a human agent. This is the inverse of AI resolution rate, but worth tracking on its own because you can break it down by reason: customer requested a human, AI could not find an answer, AI detected a complex issue, or the conversation hit a pre-defined escalation trigger. Understanding why conversations escalate tells you what to fix.
5. Knowledge Base Gap Rate
How often the AI responds with something like "I do not have information about that" or falls back to a generic response because it cannot find relevant content. Every gap represents a category of questions that your knowledge base does not cover. Track these gaps and prioritize filling the most frequent ones. See How to Improve AI Customer Service Accuracy.
How to Track These Metrics
Conversation data is stored in the platform's conversation records, which capture every message, timestamp, channel, and outcome. You can review conversations through the Live Operator Chat inbox and use the chatbot analytics features to see aggregate metrics.
For a more detailed view, set up a workflow that runs daily, queries the conversation data for the past 24 hours, calculates the five metrics, and sends a summary report to your team. This gives you a daily snapshot without manual effort.
Benchmarks by Business Type
- E-commerce: AI resolution rate of 70-85% (product questions and order status are highly automatable). Response time under 5 seconds for AI, under 2 minutes for human.
- SaaS: AI resolution rate of 55-70% (technical questions are more varied). Response time under 5 seconds for AI, under 5 minutes for human.
- Healthcare: AI resolution rate of 40-60% (many questions require privacy-sensitive handling). CSAT should be tracked carefully since incorrect medical information has serious consequences.
- Professional services: AI resolution rate of 50-65% (client questions often require judgment). Focus on escalation speed rather than pure resolution rate.
Using Metrics to Improve
Metrics are only useful if you act on them. Review your numbers weekly and look for actionable patterns:
- If AI resolution rate is below 60%, your knowledge base needs more content. Review escalated conversations to find what topics are missing.
- If CSAT for AI conversations is low, review the actual responses. The AI might be technically correct but too brief, too formal, or missing context that would make the answer more helpful.
- If escalation rate spikes on certain days, check if something changed (new product launch, pricing update, outage) and update the knowledge base immediately.
- If human response time is increasing, you may need to add agents during peak hours or improve the AI to handle more of the load.
Track your support performance across AI and human channels. Set up automated reporting and improve your resolution rate over time.
Get Started Free