Reviews AI Tools Open Source Live News AI Official

Reviews AI Tools Open Source Live News AI Official

Open Source

Explore the latest AI open-source projects from GitHub and HuggingFace.

Latest AI/LLM news and in-depth reviews.
We analyze usability, potential, and trade-offs.

info@evermx.com

LLM

Claude
Gemini
GPT
Llama
Other LLM

More Content

AI Tools
Open Source
IT News
Tutorials
Research

Official Sites

Anthropic (Claude)
Google AI (Gemini)
OpenAI (GPT)
Meta AI (Llama)
Hugging Face

© 2026 Evermx. All rights reserved.

About Editorial Policy Contact Privacy Policy Terms of Service

Reviews Tools Open Source Live Official Profile

Supertonic - Open Source | Evermx | Evermx

Back to Open Source

Supertonic

Trending

Supertonic

supertone-incMIT

TTS11.2K Stars1.2K Forks67 views

Supertonic is a lightning-fast, on-device multilingual text-to-speech engine from Supertone that runs natively through ONNX Runtime. It is designed to be lightweight enough to ship inside mobile apps, web browsers (via WebGPU), and edge devices without server dependencies. The project provides first-class bindings for Swift, Kotlin, Python, Rust, Go, C++, Flutter, Node.js, and the web.

Key Features

On-device multilingual speech synthesis with zero server dependency
ONNX Runtime backend with WebGPU acceleration for the browser
Cross-platform bindings for Swift, Kotlin, Python, Rust, Go, C++, Flutter, and Node.js
Lightweight footprint designed for mobile, edge, and embedded deployment

Tags

text-to-speechttson-deviceonnxmultilingualspeech-synthesiswebgpu

Related Projects

GPT-SoVITS

GPT-SoVITS

RVC-Boss

Open-source WebUI for few-shot and zero-shot voice cloning and text-to-speech, producing a usable voice from as little as a 5-second sample.

VibeVoice

VibeVoice

Microsoft

Microsoft's MIT-licensed open frontier voice AI: 1.5B long-form TTS up to 90 minutes with 4 speakers, 0.5B streaming TTS at 300 ms latency, and 7B ASR for 60-minute single-pass transcription. 47k+ stars.

ChatTTS

ChatTTS

2noise

A dialogue-optimized open TTS model trained on 100,000+ hours that adds fine-grained prosody — laughter, pauses, interjections — with multi-speaker, English/Chinese support.

AGPL-3.0 (code); CC BY-NC 4.0 (model)65

Bark

Bark

suno-ai

Suno's fully generative text-to-audio model — speech, music, and sound effects from one transformer, with nonverbal cues like [laughs] and 100+ voice presets (MIT).