A powerful speech recognition toolkit that offers real-time processing, supports multiple languages, and provides speaker diarization and emotion detection features.
Discovered on GitHub via GitHub:modelscope