Chethan Kumar D A
chethan62
AI & ML interests
tech
Recent Activity
liked a model 10 days ago
XiaomiMiMo/MiMo-V2.5-ASR liked a Space 17 days ago
anycoderapps/VibeVoice-Realtime-0.5B liked a model 17 days ago
microsoft/VibeVoice-Realtime-0.5BOrganizations
None yet
TTS
spaces
- Runtime errorAgentsFeatured2.77k
XTTS
🐸2.77kGenerate speech from text using a reference voice
- Build errorAgents36
Moonshine ASR
🌒36Fast & efficient ASR outperforming Whisper!
- RunningAgents1.16k
Edge TTS Text To Speech
👁1.16kGenerate spoken audio from text using Edge TTS
- PausedAgents855
Video Dubbing (SoniTranslate)
🌍855Video Dubbing with Open Source Projects
papers
-
The Chosen One: Consistent Characters in Text-to-Image Diffusion Models
Paper • 2311.10093 • Published • 58 -
NeuroPrompts: An Adaptive Framework to Optimize Prompts for Text-to-Image Generation
Paper • 2311.12229 • Published • 25 -
Diffusion Model Alignment Using Direct Preference Optimization
Paper • 2311.12908 • Published • 49 -
VMC: Video Motion Customization using Temporal Attention Adaption for Text-to-Video Diffusion Models
Paper • 2312.00845 • Published • 39
STT
TTS
Ai
spaces
- Runtime errorAgentsFeatured2.77k
XTTS
🐸2.77kGenerate speech from text using a reference voice
- Build errorAgents36
Moonshine ASR
🌒36Fast & efficient ASR outperforming Whisper!
- RunningAgents1.16k
Edge TTS Text To Speech
👁1.16kGenerate spoken audio from text using Edge TTS
- PausedAgents855
Video Dubbing (SoniTranslate)
🌍855Video Dubbing with Open Source Projects
webgpu
papers
-
The Chosen One: Consistent Characters in Text-to-Image Diffusion Models
Paper • 2311.10093 • Published • 58 -
NeuroPrompts: An Adaptive Framework to Optimize Prompts for Text-to-Image Generation
Paper • 2311.12229 • Published • 25 -
Diffusion Model Alignment Using Direct Preference Optimization
Paper • 2311.12908 • Published • 49 -
VMC: Video Motion Customization using Temporal Attention Adaption for Text-to-Video Diffusion Models
Paper • 2312.00845 • Published • 39
models