Submitted by CoreloneH 83 Stream-R1: Reliability-Perplexity Aware Reward Distillation for Streaming Video Generation FrameX-AI 12 1
Submitted by LMD0311 48 HERMES++: Toward a Unified Driving World Model for 3D Scene Understanding and Generation H-EmbodVis 35 1
Submitted by yhyang-myron 15 PhysForge: Generating Physics-Grounded 3D Assets for Interactive Virtual World · 10 authors 13 2
Submitted by DyJiang 13 D-OPSD: On-Policy Self-Distillation for Continuously Tuning Step-Distilled Diffusion Models Tongyi-MAI 4 1
Submitted by yilunzhao 12 Rethinking Reasoning-Intensive Retrieval: Evaluating and Advancing Retrievers in Agentic Search Systems Yale University 4 1
Submitted by csfufu 8 OpenSearch-VL: An Open Recipe for Frontier Multimodal Search Agents Tencent Hunyuan 4 1
Submitted by taesiri 4 Awaking Spatial Intelligence in Unified Multimodal Understanding and Generation · 19 authors 2.1k
Submitted by dorienh 2 APEX: Large-scale Multi-task Aesthetic-Informed Popularity Prediction for AI-Generated Music AMAAI Lab 5 1
Submitted by EdBianchi 1 Parameter-Efficient Multi-View Proficiency Estimation: From Discriminative Classification to Generative Feedback · 2 authors 1
Submitted by huimeiwang-1993 1 MedSkillAudit: A Domain-Specific Audit Framework for Medical Research Agent Skills AIPOCH 515 1