Open Source

Explore the latest AI open-source projects from GitHub and HuggingFace.

Evermx

Latest AI/LLM news and in-depth reviews.
We analyze usability, potential, and trade-offs.

info@evermx.com

LLM

Claude
Gemini
GPT
Llama
Other LLM

Official Sites

Anthropic (Claude)
Google AI (Gemini)
OpenAI (GPT)
Meta AI (Llama)
Hugging Face

About Editorial Policy Contact Privacy Policy Terms of Service

Reviews Tools Open Source Live Official Profile

VibeVoice - Open Source | Evermx | Evermx

Back to Open Source

Trending

VibeVoice

microsoftMIT

View on GitHub

TTS23.5K Stars2.6K Forks256 views

Microsoft's open-source frontier voice AI combining text-to-speech and automatic speech recognition. Features 60-minute long-form ASR, 90-minute continuous TTS, and real-time streaming with 300ms latency using continuous speech tokenizers at 7.5 Hz frame rate.

Key Features

60-minute long-form ASR in a single pass with speaker identification
90-minute continuous TTS with up to 4 distinct speakers
Real-time streaming TTS with 0.5B parameter model and 300ms latency
Multilingual support covering 50+ languages
Ultra-low 7.5 Hz continuous speech tokenizer with next-token diffusion

Related Projects

TrendingTTS

GitHub

58.9K6.4K

GPT-SoVITS

RVC-Boss

Open-source WebUI for few-shot and zero-shot voice cloning and text-to-speech, producing a usable voice from as little as a 5-second sample.

VibeVoice

Microsoft

Microsoft's MIT-licensed open frontier voice AI: 1.5B long-form TTS up to 90 minutes with 4 speakers, 0.5B streaming TTS at 300 ms latency, and 7B ASR for 60-minute single-pass transcription. 47k+ stars.

ChatTTS

2noise

A dialogue-optimized open TTS model trained on 100,000+ hours that adds fine-grained prosody — laughter, pauses, interjections — with multi-speaker, English/Chinese support.

AGPL-3.0 (code); CC BY-NC 4.0 (model)60

TrendingTTS

GitHub

39.2K4.7K

Bark

suno-ai

Suno's fully generative text-to-audio model — speech, music, and sound effects from one transformer, with nonverbal cues like [laughs] and 100+ voice presets (MIT).

MIT36

Open Source

VibeVoice

Key Features

Tags

Related Projects

GPT-SoVITS

VibeVoice

ChatTTS

Bark