Trending

FunASR

modelscopeMIT

STT15.5K Stars1.6K Forks170 views

FunASR is a fundamental end-to-end speech recognition toolkit developed by Alibaba DAMO Academy (ModelScope), bridging academic research and industrial applications. It provides 20+ industrial-grade pretrained models including Paraformer, SenseVoice, and Fun-ASR-Nano, supporting both streaming and non-streaming ASR, voice activity detection (VAD), punctuation restoration, speaker diarization, emotion recognition, and keyword spotting. Fun-ASR-Nano supports 31 languages with low-latency real-time transcription trained on tens of millions of hours of real speech data.

Key Features

20+ industrial-grade pretrained ASR models (Paraformer, SenseVoice, Fun-ASR-Nano)
Real-time streaming and batch offline speech recognition
Voice Activity Detection (VAD) and speaker diarization
Multilingual support for 31+ languages including Mandarin, English, Japanese, Korean
Punctuation restoration, emotion recognition, and keyword spotting

Open Source

FunASR

Key Features

Tags

Related Projects

whisper.cpp

Handy

faster-whisper

WhisperX