Open Source
Explore the latest AI open-source projects from GitHub and HuggingFace.
Explore the latest AI open-source projects from GitHub and HuggingFace.
NeuTTS is the world's first super-realistic on-device text-to-speech speech language model with instant voice cloning, developed by Neuphonic. Built on a compact 0.5B LLM backbone (Qwen 0.5B), it brings natural-sounding speech, real-time performance, and speaker cloning to local devices, unlocking a new category of embedded voice agents, assistants, and compliance-safe applications. The model uses the NeuCodec audio codec that achieves exceptional audio quality at low bitrates using a single codebook.
Microsoft
Microsoft's MIT-licensed open frontier voice AI: 1.5B long-form TTS up to 90 minutes with 4 speakers, 0.5B streaming TTS at 300 ms latency, and 7B ASR for 60-minute single-pass transcription. 47k+ stars.
microsoft
Open-source frontier voice AI for TTS and ASR
resemble-ai
Family of SoTA open-source TTS models by Resemble AI with zero-shot voice cloning, 23+ language support, and paralinguistic controls across 350M-500M parameter variants.
OpenBMB
OpenBMB's tokenizer-free 2B-parameter TTS model emitting native 48kHz audio across 30 languages with voice design, controllable cloning, and an OpenAI-compatible endpoint.