Small Language Models microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • Updated Dec 10, 2025 • 303k • 1.58k
microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • Updated Dec 10, 2025 • 303k • 1.58k
TTS SparkAudio/Spark-TTS-0.5B Text-to-Speech • Updated Mar 7, 2025 • 887 • 725 sesame/csm-1b Text-to-Speech • Updated Dec 1, 2025 • 138k • 2.34k hexgrad/Kokoro-82M Text-to-Speech • Updated Apr 10, 2025 • 9M • • 5.81k speaches-ai/Kokoro-82M-v1.0-ONNX-fp16 Text-to-Speech • Updated Mar 21, 2025 • 2
Small Language Models microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • Updated Dec 10, 2025 • 303k • 1.58k
microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • Updated Dec 10, 2025 • 303k • 1.58k
TTS SparkAudio/Spark-TTS-0.5B Text-to-Speech • Updated Mar 7, 2025 • 887 • 725 sesame/csm-1b Text-to-Speech • Updated Dec 1, 2025 • 138k • 2.34k hexgrad/Kokoro-82M Text-to-Speech • Updated Apr 10, 2025 • 9M • • 5.81k speaches-ai/Kokoro-82M-v1.0-ONNX-fp16 Text-to-Speech • Updated Mar 21, 2025 • 2