Chatterbox
resemble-ai
MIT-licensed open TTS with zero-shot voice cloning - 500M params, 23+ languages.
What is Chatterbox?
Chatterbox is a family of open TTS models from Resemble AI. The latest Multilingual V3 (500M params) covers 23+ languages with cross-language voice cloning; Chatterbox-Turbo (350M) targets low-latency voice agents. Both support zero-shot cloning from a reference clip, with MIT on code and weights.
Pros & Cons
Pros
- MIT on code and weights - the most permissive license among rising TTS models
- Actively maintained by a well-resourced voice company with rapid iteration
- Multilingual V3 covers 23+ languages with cross-language voice cloning
Cons
- High star count for a roughly one-year-old repo warrants some caution
- Direction may shift with the backing company's commercial priorities
- Quality comparisons are self-reported; independent V3 benchmarks are limited
License
MIT (OSI-open)
When it is interesting
You need MIT-licensed, production-ready multilingual TTS with voice cloning that you can self-host commercially.
When it is too early
You need fully community-verified V3 benchmarks or worry about long-term open-source commitment from a VC-backed company.
Commercial alternative & related
- Commercial counterpart: ElevenLabs
This repo featured in the 2026-07 edition of the Open-Source AI Radar.
voicebox
jamiepine
A free, on-device alternative to ElevenLabs for TTS, voice cloning and dictation.
VoxCPM
OpenBMB
Tokenizer-free TTS from OpenBMB covering 30 languages with voice design and real-time streaming.
supertonic
supertone-inc
Fast on-device TTS via ONNX with 31-language support, running on CPU, browser and mobile.