Open Source
Explore the latest AI open-source projects from GitHub and HuggingFace.
Explore the latest AI open-source projects from GitHub and HuggingFace.
Microsoft's open-source frontier voice AI combining text-to-speech and automatic speech recognition. Features 60-minute long-form ASR, 90-minute continuous TTS, and real-time streaming with 300ms latency using continuous speech tokenizers at 7.5 Hz frame rate.
Microsoft
Microsoft's MIT-licensed open frontier voice AI: 1.5B long-form TTS up to 90 minutes with 4 speakers, 0.5B streaming TTS at 300 ms latency, and 7B ASR for 60-minute single-pass transcription. 47k+ stars.
resemble-ai
Family of SoTA open-source TTS models by Resemble AI with zero-shot voice cloning, 23+ language support, and paralinguistic controls across 350M-500M parameter variants.
OpenBMB
OpenBMB's tokenizer-free 2B-parameter TTS model emitting native 48kHz audio across 30 languages with voice design, controllable cloning, and an OpenAI-compatible endpoint.
index-tts
Industrial-grade zero-shot TTS with precise duration control and emotion disentanglement