h1t/TCD-SDXL-LoRA
Text-to-Image โข Updated โข 1.31k โข โข 117
Transcribe audio to text in various languages
Text-to-speech (TTS) with Next-gen Kaldi
Generate voice with text or audio input
Generate high-quality speech from text using a prompt audio
Generate speech in a cloned voice from a short audio sample
Generate a talking face video from an image and audio