The Power of Sound (TPoS): Audio Reactive Video Generation with Stable Diffusion Paper • 2309.04509 • Published Sep 8, 2023 • 1
Read, Watch and Scream! Sound Generation from Text and Video Paper • 2407.05551 • Published Jul 8, 2024
When Do Diffusion Models learn to Generate Multiple Objects? Paper • 2605.00273 • Published 6 days ago • 6
When Do Diffusion Models learn to Generate Multiple Objects? Paper • 2605.00273 • Published 6 days ago • 6
When Do Diffusion Models learn to Generate Multiple Objects? Paper • 2605.00273 • Published 6 days ago • 6
DISCO: Diversifying Sample Condensation for Efficient Model Evaluation Paper • 2510.07959 • Published Oct 9, 2025 • 15
Diffusion Classifiers Understand Compositionality, but Conditions Apply Paper • 2505.17955 • Published May 23, 2025 • 22
Diffusion Classifiers Understand Compositionality, but Conditions Apply Paper • 2505.17955 • Published May 23, 2025 • 22