NeuTTS Air
neuphonic
On-device TTS with instant voice cloning - ~360M active params, GGUF builds for phones, laptops and Raspberry Pi.
What is NeuTTS Air?
NeuTTS is a collection of on-device TTS models by Neuphonic on small LLM backbones with a 50Hz neural codec. NeuTTS-Air (~360M active params, Apache-2.0) does English with instant cloning from 3 seconds of audio; GGUF quantizations run on phones, laptops and single-board computers. Nano adds Spanish/German/French under a more restrictive license.
Pros & Cons
Pros
- GGUF-first design deploys out of the box on Raspberry Pi and Android
- NeuTTS-Air weights are Apache-2.0 - genuinely open for commercial use
- Instant voice cloning from 3 seconds at on-device scale is rare in this weight class
Cons
- Multilingual Nano weights carry a license needing paid commercial use above a revenue threshold
- Apache-licensed Air model is English-only; multilingual needs the restricted Nano
- Small startup; impersonator sites have appeared - verify the source
License
Apache-2.0 (Air weights) (Open weight, with conditions) - model license: NeuTTS Open License v1.0 (Nano)
NeuTTS-Air weights are Apache-2.0; the multilingual NeuTTS-Nano weights use the NeuTTS Open License v1.0 (free for research/limited commercial, paid above a revenue threshold). Verify via neuphonic.com and this GitHub only - impersonator sites exist.
When it is interesting
You need genuinely edge-deployable TTS with cloning for embedded, mobile or compliance-sensitive uses where sending audio to an API is not acceptable.
When it is too early
You need multilingual support under a fully open license or independently verified quality benchmarks.
Commercial alternative & related
- Commercial counterpart: ElevenLabs
This repo featured in the 2026-07 edition of the Open-Source AI Radar.
voicebox
jamiepine
A free, on-device alternative to ElevenLabs for TTS, voice cloning and dictation.
VoxCPM
OpenBMB
Tokenizer-free TTS from OpenBMB covering 30 languages with voice design and real-time streaming.
Chatterbox
resemble-ai
MIT-licensed open TTS with zero-shot voice cloning - 500M params, 23+ languages.