JarvisEvo: Towards a Self-Evolving Photo Editing Agent with Synergistic Editor-Evaluator Optimization Paper • 2511.23002 • Published Nov 28, 2025 • 26
OmniPSD: Layered PSD Generation with Diffusion Transformer Paper • 2512.09247 • Published 26 days ago • 46
Wan-Move: Motion-controllable Video Generation via Latent Trajectory Guidance Paper • 2512.08765 • Published 26 days ago • 128
UltraFlux: Data-Model Co-Design for High-quality Native 4K Text-to-Image Generation across Diverse Aspect Ratios Paper • 2511.18050 • Published Nov 22, 2025 • 37
view article Article We’re open-sourcing our text-to-image model and the process behind it Nov 12, 2025 • 76
Uniworld-V2: Reinforce Image Editing with Diffusion Negative-aware Finetuning and MLLM Implicit Feedback Paper • 2510.16888 • Published Oct 19, 2025 • 21
SwarmSys: Decentralized Swarm-Inspired Agents for Scalable and Adaptive Reasoning Paper • 2510.10047 • Published Oct 11, 2025 • 13
LucidFlux: Caption-Free Universal Image Restoration via a Large-Scale Diffusion Transformer Paper • 2509.22414 • Published Sep 26, 2025 • 21
Intern-S1: A Scientific Multimodal Foundation Model Paper • 2508.15763 • Published Aug 21, 2025 • 259
An Empirical Study of GPT-4o Image Generation Capabilities Paper • 2504.05979 • Published Apr 8, 2025 • 64
Temporal Regularization Makes Your Video Generator Stronger Paper • 2503.15417 • Published Mar 19, 2025 • 22
MusicInfuser: Making Video Diffusion Listen and Dance Paper • 2503.14505 • Published Mar 18, 2025 • 11
STEVE: AStep Verification Pipeline for Computer-use Agent Training Paper • 2503.12532 • Published Mar 16, 2025 • 17
CoRe^2: Collect, Reflect and Refine to Generate Better and Faster Paper • 2503.09662 • Published Mar 12, 2025 • 33
Seedream 2.0: A Native Chinese-English Bilingual Image Generation Foundation Model Paper • 2503.07703 • Published Mar 10, 2025 • 37
YuE: Scaling Open Foundation Models for Long-Form Music Generation Paper • 2503.08638 • Published Mar 11, 2025 • 71
MagicInfinite: Generating Infinite Talking Videos with Your Words and Voice Paper • 2503.05978 • Published Mar 7, 2025 • 36
Effective and Efficient Masked Image Generation Models Paper • 2503.07197 • Published Mar 10, 2025 • 11