Coevolving Representations in Joint Image-Feature Diffusion Paper • 2604.17492 • Published 9 days ago • 4
EditCrafter: Tuning-free High-Resolution Image Editing via Pretrained Diffusion Model Paper • 2604.10268 • Published 17 days ago • 11
StyleID: A Perception-Aware Dataset and Metric for Stylization-Agnostic Facial Identity Recognition Paper • 2604.21689 • Published 5 days ago • 24
WorldMark: A Unified Benchmark Suite for Interactive Video World Models Paper • 2604.21686 • Published 5 days ago • 36
LLaTiSA: Towards Difficulty-Stratified Time Series Reasoning from Visual Perception to Semantics Paper • 2604.17295 • Published 9 days ago • 84
Exploring Spatial Intelligence from a Generative Perspective Paper • 2604.20570 • Published 6 days ago • 21
A Self-Evolving Framework for Efficient Terminal Agents via Observational Context Compression Paper • 2604.19572 • Published 7 days ago • 21
LLaDA2.0-Uni: Unifying Multimodal Understanding and Generation with Diffusion Large Language Model Paper • 2604.20796 • Published 6 days ago • 233
Elucidating the SNR-t Bias of Diffusion Probabilistic Models Paper • 2604.16044 • Published 11 days ago • 73
GlobalSplat: Efficient Feed-Forward 3D Gaussian Splatting via Global Scene Tokens Paper • 2604.15284 • Published 12 days ago • 24
HY-World 2.0: A Multi-Modal World Model for Reconstructing, Generating, and Simulating 3D Worlds Paper • 2604.14268 • Published 13 days ago • 115
Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe Paper • 2604.13016 • Published 14 days ago • 87
Strips as Tokens: Artist Mesh Generation with Native UV Segmentation Paper • 2604.09132 • Published 18 days ago • 53
DataFlex: A Unified Framework for Data-Centric Dynamic Training of Large Language Models Paper • 2603.26164 • Published Mar 27 • 362
Adam's Law: Textual Frequency Law on Large Language Models Paper • 2604.02176 • Published 26 days ago • 495
RefineAnything: Multimodal Region-Specific Refinement for Perfect Local Details Paper • 2604.06870 • Published 20 days ago • 41
Matrix-Game 3.0: Real-Time and Streaming Interactive World Model with Long-Horizon Memory Paper • 2604.08995 • Published 18 days ago • 48
FORGE:Fine-grained Multimodal Evaluation for Manufacturing Scenarios Paper • 2604.07413 • Published 20 days ago • 95