Trending

NeuTTS

neuphonicNOASSERTION

TTS5.1K Stars562 Forks108 views

NeuTTS is the world's first super-realistic on-device text-to-speech speech language model with instant voice cloning, developed by Neuphonic. Built on a compact 0.5B LLM backbone (Qwen 0.5B), it brings natural-sounding speech, real-time performance, and speaker cloning to local devices, unlocking a new category of embedded voice agents, assistants, and compliance-safe applications. The model uses the NeuCodec audio codec that achieves exceptional audio quality at low bitrates using a single codebook.

Key Features

On-device deployment in GGML format for phones, laptops, and Raspberry Pi
Instant voice cloning with as little as 3 seconds of reference audio
Ultra-realistic human-quality speech synthesis powered by 0.5B LLM backbone
NeuCodec audio codec achieving exceptional quality at low bitrates with single codebook
Real-time inference performance optimized for embedded and edge devices

Open Source

NeuTTS

Key Features

Tags

Related Projects

GPT-SoVITS

VibeVoice

ChatTTS

Bark