ASR nvidia/nemotron-speech-streaming-en-0.6b Automatic Speech Recognition • Updated 24 days ago • 49.1k • 518
nvidia/nemotron-speech-streaming-en-0.6b Automatic Speech Recognition • Updated 24 days ago • 49.1k • 518
INDIC TTS DATASETS my own collection of TTS Datasets for finetuning models on Indic languages. edwixx/Gujarati40h Updated Oct 18, 2024 • 6 edwixx/Tamil200hours Updated May 7, 2024 • 43
Audio Models Collection of best text-to-audio models. stabilityai/stable-audio-open-1.0 Text-to-Audio • Updated Jun 19, 2025 • 23k • 1.43k Running on Zero 325 TangoFlux 🚀 325 Text to Audio (Sound SFX) Generator openbmb/MiniCPM-o-2_6 Any-to-Any • 9B • Updated Oct 5, 2025 • 111k • 1.28k
TTS Collection of some of the TTS models i found cool SWivid/F5-TTS Text-to-Speech • Updated Mar 21, 2025 • 690k • 1.16k fishaudio/fish-speech-1.4 Text-to-Speech • Updated Nov 5, 2024 • 763 • 457 coqui/XTTS-v2 Text-to-Speech • Updated Dec 11, 2023 • 6.26M • 3.46k microsoft/speecht5_tts Text-to-Speech • Updated Nov 8, 2023 • 246k • 827
ASR nvidia/nemotron-speech-streaming-en-0.6b Automatic Speech Recognition • Updated 24 days ago • 49.1k • 518
nvidia/nemotron-speech-streaming-en-0.6b Automatic Speech Recognition • Updated 24 days ago • 49.1k • 518
Audio Models Collection of best text-to-audio models. stabilityai/stable-audio-open-1.0 Text-to-Audio • Updated Jun 19, 2025 • 23k • 1.43k Running on Zero 325 TangoFlux 🚀 325 Text to Audio (Sound SFX) Generator openbmb/MiniCPM-o-2_6 Any-to-Any • 9B • Updated Oct 5, 2025 • 111k • 1.28k
INDIC TTS DATASETS my own collection of TTS Datasets for finetuning models on Indic languages. edwixx/Gujarati40h Updated Oct 18, 2024 • 6 edwixx/Tamil200hours Updated May 7, 2024 • 43
TTS Collection of some of the TTS models i found cool SWivid/F5-TTS Text-to-Speech • Updated Mar 21, 2025 • 690k • 1.16k fishaudio/fish-speech-1.4 Text-to-Speech • Updated Nov 5, 2024 • 763 • 457 coqui/XTTS-v2 Text-to-Speech • Updated Dec 11, 2023 • 6.26M • 3.46k microsoft/speecht5_tts Text-to-Speech • Updated Nov 8, 2023 • 246k • 827