Provider classes
When picking models, adjusting stability, or accessing third-party providers — use provider-specific TTSVoice classes. You can also optionally adjust clarity and latency_mode to match the agent’s interaction style.Example: ElevenLabs
Example: AWS Polly
from polyai.voice import PollyVoiceExample: Microsoft Azure TTS
Example: Cartesia
EmotionKind:ANGER,POSITIVITY,SURPRISEEmotionIntensity:LOWEST,LOW,HIGH,HIGHEST
Example: Rime
Example: Minimax
Example: Hume
Example: Google TTS
Example: Custom provider
Voice randomization
UseVoiceWeighting to randomly select a voice based on weighted probabilities:
- Weights must sum to 1.0.
- Voices without explicit weights share the remaining probability equally.
Cache behavior
- Changing
model_iddoes not automatically invalidate cached audio. - To reset cached audio:
- Go to Audio → Cache and delete existing entries.
- Or, create a new voice entry with a different voice_id.
- You can prepend the model ID to the voice ID (e.g.
eleven_flash_v2/a1b2c3...) if you want to isolate caches across models.
Additional options
- clarity – fine-tunes articulation sharpness per utterance (0.0–1.0).
- latency_mode – chooses a response profile (“swift”, “balanced”, “precise”, “turbo”) consistent with Interaction style.
- stability – controls tone variability across runs.
- randomize_voice() – supports external providers for weighted selection.

