Choose which LLM powers your agent – including PolyAI Raven, our proprietary model family built for conversational AI.
Choose the LLM that powers your agent. The model directly affects response quality, latency, and cost – a wrong choice can make your agent slow, expensive, or inaccurate.
Recommended: Raven 3.5. PolyAI’s purpose-built conversational model — 24+ languages, sub-300 ms latency, powers both voice and chat.
PolyAI’s proprietary Raven model family is specialized for customer service across voice and chat. Raven eliminates the trade-off between speed and accuracy – delivering sub-300ms latency, fewer errors, and more natural responses than general-purpose LLMs.Raven 3.5 is the recommended model for all deployments. It supports voice and chat, 24+ languages, and has powerful auto-reasoning, out-of-domain detection, custom style following, and built-in safety.
Learn more about Raven
Full details on capabilities, supported languages, and why Raven is recommended for most deployments.
Model
Good for
GPT-5.2
High-quality interactions requiring nuance and strong reasoning
GPT-5.2 chat
Extended dialogue and conversational stability
GPT-5 mini
Lower latency and reduced cost for mid-complexity use cases
GPT-5 nano
Simple tasks and fast-response workloads
GPT-4o
Versatile balance of reasoning, speed, and cost
GPT-4o mini
Everyday queries and high-volume deployments
GPT-4.1
Strong reasoning with improved cross-task performance
GPT-4.1 mini
Cost-effective, latency-focused for lighter workloads