Look Where It Matters: High-Resolution Crops Retrieval for Efficient VLMs Paper • 2603.16932 • Published 13 days ago • 83
ibm-granite/granite-4.0-1b-speech Automatic Speech Recognition • 2B • Updated 10 days ago • 46.1k • 192
Continuous Speech Synthesis using per-token Latent Diffusion Paper • 2410.16048 • Published Oct 21, 2024 • 30