Open Source
Explore the latest AI open-source projects from GitHub and HuggingFace.
Explore the latest AI open-source projects from GitHub and HuggingFace.
Higgs Audio is an expressive audio foundation model from Boson AI, pretrained on over 10 million hours of audio and text data, capable of generating natural multi-speaker dialogues, melodic humming with cloned voices, and simultaneous speech with background music. The latest V2.5 release condenses the architecture to 1B parameters while surpassing the prior 3B model in speed and accuracy through Group Relative Policy Optimization (GRPO) alignment on a curated Voice Bank dataset. It achieves state-of-the-art results on EmergentTTS-Eval, outperforming GPT-4o-mini-TTS on expressive emotion and intonation tasks.
RVC-Project
The de facto open-source voice-conversion framework — trains a usable voice clone from 10 minutes of audio via top-1 feature retrieval, with a ~170ms real-time GUI (MIT).
myshell-ai
Instant voice cloning framework by MIT and MyShell with 36k+ GitHub stars, enabling zero-shot cross-lingual voice replication from just seconds of reference audio.
Jamie Pine
A local-first, open-source AI voice studio that clones voices, generates speech across 23 languages and 7 TTS engines, dictates into any app, and gives MCP agents a voice you own.
Anjok07
Deep-learning desktop GUI for vocal and stem separation from any audio track.