Open weight, with conditionsOpen voice and text-to-speech

supertonic

supertone-inc

Fast on-device TTS via ONNX with 31-language support, running on CPU, browser and mobile.

11.3k stars(as of 2026-06-05)View on GitHub

Overview

What is supertonic?

A very fast on-device TTS that runs natively through ONNX, with a compact ~99M-parameter model. It covers 31 languages, runs on CPU without a GPU, and even does browser inference via WebGPU.

Analysis

Pros & Cons

Pros

Runs on CPU - Raspberry Pi, mobile, browser, no network needed
Real-time (a whole web page narrated in under a second)
SDKs across Python, Node, browser, Java, C++, Swift, iOS, Rust, Flutter

Cons

No built-in voice cloning in the open variant (fixed voice only)
The model is OpenRAIL-M, so this is open-weight with use restrictions, not fully OSI-open
78 open issues at time of writing

License

MIT (code) (Open weight, with conditions) - model license: OpenRAIL-M

Code MIT, model OpenRAIL-M (open weight, with conditions).