Performance monitoring

For quick QA workflows and daily health checks, see QA and analytics. This page covers in-depth performance analysis and optimization.

Monitoring your agent’s performance helps you identify issues before they impact customers. This page covers how to track key metrics, diagnose problems, and optimize performance.

Quick reference

I need to…	Action	Time estimate
Check overall agent performance	Analytics → Performance dashboard	2 min
Identify slow responses	Conversation Review → Filter by latency	5 min
Debug ASR accuracy issues	Diagnosis → Check transcription	5 min
Find knowledge gaps	Analytics → Unhandled queries	10 min
Monitor handoff rates	Analytics → Handoff metrics	3 min
Check function errors	Conversation Review → Filter by errors	5 min
Review call quality	Listen to recordings	10-30 min
Track version performance	Compare versions in Analytics	10 min

Key performance metrics

Response latency

What it measures: Time from when the user stops speaking to when the agent starts responding. Target: < 2 seconds for most queries How to check:

Go to Analytics → Performance
Review average response time
Filter by time period, environment, or variant

Common causes of high latency:

Slow function execution
Complex knowledge retrieval
External API delays
Network issues
Overly complex agent logic

ASR (Automatic Speech Recognition) accuracy

What it measures: How accurately the agent transcribes what the caller says. Target: > 95% word accuracy How to check:

Go to Conversation Review
Review transcriptions
Compare to audio recordings
Look for patterns in errors

Common causes of low accuracy:

Background noise
Strong accents or dialects
Poor phone connection
Uncommon words or jargon
Fast or unclear speech

Containment rate

What it measures: Percentage of calls handled without human handoff. Target: Varies by use case (typically 60-90%) How to check:

Go to Analytics → Handoffs
Review handoff rate over time
Analyze reasons for handoffs

Common causes of low containment:

Knowledge gaps
Complex queries beyond agent capability
Caller preference for human
Technical issues or errors

Task completion rate

What it measures: Percentage of calls where the agent successfully completes the intended task. Target: > 80% How to check:

Define success criteria (booking made, question answered, etc.)
Use Test Sets to track completion
Review Conversation Review for outcomes

Common causes of low completion:

Unclear agent responses
Missing information
Function failures
Poor conversation flow

Monitoring tools

Analytics dashboard

The Analytics dashboard provides high-level metrics:

Call volume - Total calls over time
Average call duration - How long calls last
Handoff rate - Percentage transferred to humans
Top intents - Most common user requests
Performance trends - Latency and quality over time

How to use it:

Go to Analytics
Select date range
Filter by environment, variant, or version
Export data for deeper analysis

Conversation Review

Conversation Review lets you drill into individual calls:

Search and filter - Find specific calls or patterns
Listen to recordings - Hear actual conversations
Review transcriptions - See what was said
Check Diagnosis - View technical details

How to use it:

Go to Conversation Review
Apply filters (date, handoff, error, etc.)
Select a call to review
Listen and analyze

Diagnosis panel

The Diagnosis panel shows technical details for each call:

Function logs - Output from conv.log
Knowledge retrieval - Which sources were used
LLM prompts and responses - What the model generated
Timing information - Latency breakdown
Errors and warnings - Technical issues

How to use it:

Open a call in Conversation Review
Click Diagnosis tab
Review logs and timing
Identify bottlenecks or errors

Test Sets

Test Sets provide automated performance tracking:

Regression testing - Ensure changes don’t break existing functionality
Version comparison - Compare performance across versions
Continuous monitoring - Run tests on schedule

How to use it:

Create Test Cases for key scenarios
Group into Test Sets
Run automatically on publish/promote
Review results and trends

Identifying performance issues

High latency

Symptoms:

Long pauses before agent responds
Callers repeat themselves
Complaints about slow service

Diagnosis steps:

Go to Conversation Review → Filter by high latency
Review Diagnosis → Check timing breakdown
Identify the bottleneck:
- Function execution - Slow API calls or complex logic
- Knowledge retrieval - Large or slow sources
- LLM generation - Complex prompts or long responses
- TTS generation - Slow voice provider

Solutions:

Optimize slow functions (see Function maintenance)
Use cached audio for common phrases
Switch to faster TTS provider (e.g., Cartesia)
Simplify knowledge sources
Add delay controls to manage user expectations

Low ASR accuracy

Symptoms:

Agent misunderstands callers
Frequent “I didn’t catch that” responses
Incorrect information provided

Diagnosis steps:

Review transcriptions in Conversation Review
Compare to audio recordings
Identify patterns:
- Specific words consistently wrong
- Certain accents or dialects problematic
- Background noise interference

Solutions:

Add custom vocabulary for domain-specific terms
Adjust ASR sensitivity settings
Use noise cancellation features
Test with different ASR models
Add clarification prompts for ambiguous input

Knowledge gaps

Symptoms:

Agent says “I don’t know”
High handoff rates for specific topics
Callers frustrated by lack of information

Diagnosis steps:

Go to Analytics → Unhandled queries
Review common questions without answers
Check Conversation Review for “no knowledge” patterns

Solutions:

Add missing topics to Managed Topics
Add relevant Connected knowledge sources
Update existing knowledge with more detail
Improve knowledge retrieval with better descriptions

Function errors

Symptoms:

Agent apologizes for technical issues
Tasks fail to complete
Error messages in Diagnosis logs

Diagnosis steps:

Go to Conversation Review → Filter by errors
Review Diagnosis → Check function logs
Identify error patterns:
- API failures
- Authentication issues
- Timeout errors
- Invalid parameters

Solutions:

Fix function code (see Function maintenance)
Update API credentials
Add error handling and retries
Optimize for performance
Add logging for better debugging

High handoff rates

Symptoms:

Many calls transferred to humans
Low containment rate
Specific topics always escalate

Diagnosis steps:

Go to Analytics → Handoffs
Review handoff reasons
Identify patterns:
- Specific intents always handoff
- Knowledge gaps
- Caller preference
- Technical failures

Solutions:

Add knowledge for common handoff reasons
Improve agent capabilities for those scenarios
Adjust handoff rules to be less aggressive
Add self-service options before handoff

Optimizing performance

Reducing latency

Quick wins:

Enable audio caching for common phrases
Switch to Cartesia for faster TTS
Use Turbo or Swift interaction mode
Optimize slow functions
Cache API responses

Advanced optimizations:

Use delay controls for long-running functions
Implement async operations
Reduce knowledge source size
Simplify LLM prompts
Pre-compute common responses

Improving ASR accuracy

Quick wins:

Add custom vocabulary for domain terms
Use clarification prompts for ambiguous input
Enable noise cancellation
Test with different ASR models

Advanced optimizations:

Train custom ASR models (enterprise feature)
Use context-aware ASR
Implement spelling confirmation for critical data
Add phonetic alternatives for names

Increasing containment

Quick wins:

Add missing knowledge
Improve handoff rules
Add self-service options
Clarify agent capabilities upfront

Advanced optimizations:

Use functions for complex tasks
Implement multi-step workflows
Add proactive suggestions
Personalize based on caller history

Enhancing quality

Quick wins:

Fix pronunciation issues
Improve response clarity
Add personality and warmth
Test with real users

Advanced optimizations:

A/B test different approaches
Use sentiment analysis
Implement dynamic responses
Personalize based on context

Monitoring workflows

Daily health check

Review Analytics dashboard for anomalies
Check handoff rate vs. baseline
Review recent errors in Conversation Review
Verify no critical function failures
Check for ASR accuracy issues

Time: 10-15 minutes

Weekly performance review

Analyze performance trends over the week
Review top unhandled queries
Listen to sample calls (5-10)
Identify patterns in handoffs
Check function performance
Review Test Set results
Plan improvements for next week

Time: 30-60 minutes

Monthly deep dive

Compare performance month-over-month
Analyze containment rate trends
Review all handoff reasons
Audit knowledge coverage
Optimize slow functions
Update Test Sets
Plan major improvements

Time: 2-4 hours

Version comparison

When deploying a new version:

Run Test Sets on both versions
Compare key metrics:
- Response latency
- Containment rate
- Task completion
- Error rates
Review sample calls from each version
Identify regressions or improvements
Decide whether to promote or rollback

Time: 30-60 minutes

Troubleshooting common issues

Sudden latency spike

Possible causes:

External API slowdown
Increased call volume
New function introduced
Network issues

Steps:

Check Analytics for timing breakdown
Review recent changes (functions, knowledge)
Test external APIs directly
Check system status
Rollback if needed

ASR accuracy drop

Possible causes:

ASR model change
New vocabulary not recognized
Increased background noise
Phone line quality issues

Steps:

Review recent transcriptions
Compare to previous period
Check for new terms or patterns
Test with different ASR settings
Add custom vocabulary if needed

Containment rate decline

Possible causes:

Knowledge gaps
New caller expectations
Seasonal changes
Agent changes

Steps:

Review handoff reasons
Identify new patterns
Add missing knowledge
Adjust handoff rules
Test improvements

Function failure spike

Possible causes:

API changes
Credential expiration
Rate limiting
Code bugs

Steps:

Review function logs
Test APIs directly
Check credentials
Review recent code changes
Fix and redeploy

Best practices

Proactive monitoring

Set up alerts - Get notified of anomalies using the Alerts API to monitor latency, errors, and call volume
Review daily - Catch issues early
Track trends - Identify gradual degradation
Compare versions - Ensure changes improve performance

Data-driven optimization

Use Test Sets - Measure impact of changes
A/B test - Compare approaches
Listen to calls - Understand real user experience
Analyze patterns - Find root causes, not symptoms

Continuous improvement

Regular reviews - Weekly and monthly check-ins
Prioritize issues - Focus on high-impact problems
Iterate quickly - Make small, frequent improvements
Document learnings - Build institutional knowledge

Collaboration

Share insights - Keep team informed
Involve stakeholders - Get feedback from business owners
Escalate issues - Don’t let problems linger
Celebrate wins - Recognize improvements

QA and analytics - Using analytics to improve your agent
Function maintenance - Optimizing function performance
Voice and audio updates - Improving audio quality
Health checks - Proactive monitoring routines
Alerts API - Set up automated alerts for operational metrics

PolyAcademy

Maintain

Performance monitoring

Quick reference

Key performance metrics

Response latency

ASR (Automatic Speech Recognition) accuracy

Containment rate

Task completion rate

Monitoring tools

Analytics dashboard

Conversation Review

Diagnosis panel

Test Sets

Identifying performance issues

High latency

Low ASR accuracy

Knowledge gaps

Function errors

High handoff rates

Optimizing performance

Reducing latency

Improving ASR accuracy

Increasing containment

Enhancing quality

Monitoring workflows

Daily health check

Weekly performance review

Monthly deep dive

Version comparison

Troubleshooting common issues

Sudden latency spike

ASR accuracy drop

Containment rate decline

Function failure spike

Best practices

Proactive monitoring

Data-driven optimization

Continuous improvement

Collaboration

PolyAcademy

Maintain

​Quick reference

​Key performance metrics

​Response latency

​ASR (Automatic Speech Recognition) accuracy

​Containment rate

​Task completion rate

​Monitoring tools

​Analytics dashboard

​Conversation Review

​Diagnosis panel

​Test Sets

​Identifying performance issues

​High latency

​Low ASR accuracy

​Knowledge gaps

​Function errors

​High handoff rates

​Optimizing performance

​Reducing latency

​Improving ASR accuracy

​Increasing containment

​Enhancing quality

​Monitoring workflows

​Daily health check

​Weekly performance review

​Monthly deep dive

​Version comparison

​Troubleshooting common issues

​Sudden latency spike

​ASR accuracy drop

​Containment rate decline

​Function failure spike

​Best practices

​Proactive monitoring

​Data-driven optimization

​Continuous improvement

​Collaboration

​Related pages

Quick reference

Key performance metrics

Response latency

ASR (Automatic Speech Recognition) accuracy

Containment rate

Task completion rate

Monitoring tools

Analytics dashboard

Conversation Review

Diagnosis panel

Test Sets

Identifying performance issues

High latency

Low ASR accuracy

Knowledge gaps

Function errors

High handoff rates

Optimizing performance

Reducing latency

Improving ASR accuracy

Increasing containment

Enhancing quality

Monitoring workflows

Daily health check

Weekly performance review

Monthly deep dive

Version comparison

Troubleshooting common issues

Sudden latency spike

ASR accuracy drop

Containment rate decline

Function failure spike

Best practices

Proactive monitoring

Data-driven optimization

Continuous improvement

Collaboration

Related pages