Open Source
Explore the latest AI open-source projects from GitHub and HuggingFace.
Explore the latest AI open-source projects from GitHub and HuggingFace.
Microsoft's open-source frontier voice AI combining text-to-speech and automatic speech recognition. Features 60-minute long-form ASR, 90-minute continuous TTS, and real-time streaming with 300ms latency using continuous speech tokenizers at 7.5 Hz frame rate.
Sesame AI Labs
Sesame AI Labs' open-source 1B-parameter conversational speech model using Llama architecture — natural human-like intonation, multi-speaker support, HuggingFace Transformers native.
hexgrad
Lightweight 82M parameter open-weight TTS model delivering quality comparable to larger models, with Apache 2.0 licensing and 9-language support.