Skip to main content
Update voice settings, fix mispronunciations, manage cached audio, and tune call behavior to maintain high-quality voice experiences for your callers.

Quick reference

I need toโ€ฆAction
Change the agentโ€™s voiceVoice > Settings โ†’ Change
Adjust voice parametersVoice > Settings โ†’ settings gear
Fix a mispronunciationVoice > Advanced settings > Speech โ†’ Pronunciation
Update cached audioVoice > Audio library โ†’ Edit โ†’ Sync
Enable/disable barge-inVoice > Advanced settings > Call โ†’ toggle
Upload custom audioVoice > Audio library โ†’ Upload

Changing your agentโ€™s voice

Consider updating when your brand refreshes, customers report clarity issues, youโ€™re expanding to new languages, or newer voice models become available.
  1. Go to Voice > Settings
  2. Click Change to open the Voice Library
  3. Filter by Language, Region, and Gender
  4. Preview voices with custom text
  5. Click Select to apply
  6. Test in Agent Chat before publishing
For non-English projects, use a multilingual_v2 model to ensure proper language support.
For programmatic voice configuration, see voice classes and Add a voice.

Barge-in

Toggle in Voice > Advanced settings > Call. Lets callers interrupt the agent mid-sentence. Enable when: callers frequently interrupt, or you want more natural conversations. Disable when: delivering complete information (legal disclaimers), background noise causes false interruptions.

Managing audio quality

Cached audio

The Audio library tab lets you cache and optimize frequently-used audio for reduced latency and consistent quality.
  • Open Voice > Audio library
  • Click Edit to adjust stability/clarity settings or add IPA pronunciation corrections
  • Click the sync icon to regenerate, then preview
Audio is only cached after the same TTS is generated at least twice in 24 hours. For critical phrases (greetings, transfers), generate them repeatedly or upload manually.

Custom audio uploads

Upload pre-recorded audio (WAV or MP3) for maximum control over greetings, legal disclaimers, or brand-specific moments.

Fixing pronunciations

When the agent mispronounces words:
  1. Go to Voice > Advanced settings > Speech โ†’ Pronunciation section
  2. Add a pronunciation rule
  3. Enter the regex pattern for the word as it appears in text
  4. Provide the IPA replacement (e.g., โ€œPolyAIโ€ โ†’ /หˆpษ’li eษช aษช/)
  5. Test in Agent Chat
You can also use SSML for advanced control:
<break time="500ms"/>
<prosody rate="slow">Speak this slowly</prosody>

Troubleshooting

IssueLikely causeFix
Voice sounds roboticLow-quality TTSSwitch to Cartesia or ElevenLabs
Agent speaks too fastRate set too highAdjust with the settings gear in Voice Settings
Agent interrupts frequentlyBarge-in too sensitiveDisable barge-in in Advanced settings > Call
MispronunciationsTTS doesnโ€™t recognize wordAdd pronunciation rule in Advanced settings > Speech
High latencySlow TTS providerSwitch to Cartesia or use cached audio
Background noise interruptionsBarge-in too sensitiveDisable barge-in or increase speech end delay

Maintenance routine

  • Monthly: Listen to recent calls and identify voice quality issues
  • As needed: Add pronunciations for new terms
  • After voice changes: Regenerate cached audio
Last modified on June 29, 2026