ULTRA: Unified Multimodal Control for Autonomous Humanoid Whole-Body Loco-Manipulation Paper • 2603.03279 • Published Mar 3 • 1
HandX: Scaling Bimanual Motion and Interaction Generation Paper • 2603.28766 • Published 25 days ago • 12
HandX: Scaling Bimanual Motion and Interaction Generation Paper • 2603.28766 • Published 25 days ago • 12
VISion On Request: Enhanced VLLM efficiency with sparse, dynamically selected, vision-language interactions Paper • 2603.23495 • Published Mar 24 • 3
SparseCtrl: Adding Sparse Controls to Text-to-Video Diffusion Models Paper • 2311.16933 • Published Nov 28, 2023 • 1
Automated Conversion of Music Videos into Lyric Videos Paper • 2308.14922 • Published Aug 28, 2023
Light-A-Video: Training-free Video Relighting via Progressive Light Fusion Paper • 2502.08590 • Published Feb 12, 2025 • 42
Light of Normals: Unified Feature Representation for Universal Photometric Stereo Paper • 2506.18882 • Published Jun 23, 2025 • 89
Mindalogue: LLM-Powered Nonlinear Interaction for Effective Learning and Task Exploration Paper • 2410.10570 • Published Oct 14, 2024
CrossGET: Cross-Guided Ensemble of Tokens for Accelerating Vision-Language Transformers Paper • 2305.17455 • Published May 27, 2023
Generative AI for Film Creation: A Survey of Recent Advances Paper • 2504.08296 • Published Apr 11, 2025
ProFashion: Prototype-guided Fashion Video Generation with Multiple Reference Images Paper • 2505.06537 • Published May 10, 2025
Taming Flow-based I2V Models for Creative Video Editing Paper • 2509.21917 • Published Sep 26, 2025
Astrolabe: Steering Forward-Process Reinforcement Learning for Distilled Autoregressive Video Models Paper • 2603.17051 • Published Mar 17 • 109
ShotVerse: Advancing Cinematic Camera Control for Text-Driven Multi-Shot Video Creation Paper • 2603.11421 • Published Mar 12 • 34
LoGeR: Long-Context Geometric Reconstruction with Hybrid Memory Paper • 2603.03269 • Published Mar 3 • 63
LoGeR: Long-Context Geometric Reconstruction with Hybrid Memory Paper • 2603.03269 • Published Mar 3 • 63
UniT: Unified Multimodal Chain-of-Thought Test-time Scaling Paper • 2602.12279 • Published Feb 12 • 20
CoMAS: Co-Evolving Multi-Agent Systems via Interaction Rewards Paper • 2510.08529 • Published Oct 9, 2025 • 19