Open Source
Explore the latest AI open-source projects from GitHub and HuggingFace.
Explore the latest AI open-source projects from GitHub and HuggingFace.
FunASR is a fundamental end-to-end speech recognition toolkit developed by Alibaba DAMO Academy (ModelScope), bridging academic research and industrial applications. It provides 20+ industrial-grade pretrained models including Paraformer, SenseVoice, and Fun-ASR-Nano, supporting both streaming and non-streaming ASR, voice activity detection (VAD), punctuation restoration, speaker diarization, emotion recognition, and keyword spotting. Fun-ASR-Nano supports 31 languages with low-latency real-time transcription trained on tens of millions of hours of real speech data.
ggml-org
Pure C/C++ port of OpenAI Whisper for edge deployment
CJ Pais
A free, open-source, cross-platform speech-to-text app that transcribes your voice entirely offline — press a shortcut, speak, and have the text pasted into any app.
SYSTRAN
A CTranslate2-based reimplementation of OpenAI's Whisper that runs up to 4x faster at the same accuracy with lower memory, adding 8-bit quantization, batched inference, and word-level timestamps. MIT-licensed and FFmpeg-free.
m-bain
A fast open ASR system that wraps Whisper to add accurate word-level timestamps, 70x real-time batched inference, and speaker diarization for subtitles and meetings.