CHATTERBOX TTS
Natural Speech
Chatterbox from Resemble AI delivers natural, human-like text-to-speech with zero-shot voice cloning and emotion control. 23 languages and ultra-low latency on Zyka.












Chatterbox — The World's First Ecosystem
Zyka's voice AI with Resemble AI's Chatterbox. Natural speech and voice cloning.

Natural Speech
Chatterbox is Resemble AI's open-source TTS model, trained on 500,000+ hours of audio. In blind tests, a majority of listeners prefer Chatterbox voices for naturalness and clarity. Use it on Zyka.

Zero-Shot Voice Cloning
Replicate any voice from just 5–20 seconds of reference audio. Create personalized voices for narration, characters, and branding without long training. Clone then generate with Chatterbox on Zyka.

Emotion Control
Adjust emotional intensity from calm (0.1) to dramatic (1.0). Paralinguistic tags like [laugh] and [cough] add realism. Supports 23 languages with low latency for interactive apps.
Simplifying the Most Advanced Workflows
Professional TTS with natural delivery.
Provide a Voice
Use a voice you've cloned on Zyka (5–20 seconds of reference audio) or another source. Chatterbox excels at zero-shot cloning and natural delivery.

Enter Script & Emotion
Type your script and optionally set emotion level and paralinguistic tags. Chatterbox delivers with ultra-low latency (under 200ms) for real-time use cases.

Generate Natural Speech
Get human-like speech that listeners prefer in tests. Download for podcasts, games, or IVR. Compare with MiniMax, Qwen3, VoxCPM, and MOSS on Zyka.

Pick Your Plan
Get access to Chatterbox TTS and all Zyka voice models. Choose the plan that fits your needs.
Loading pricing plans...
All plans include our standard features. Need something custom?
Frequently Asked Questions
Everything you need to know about Chatterbox TTS.
G2