MoCapAnything V2: End-to-End Motion Capture for Arbitrary Skeletons Paper • 2604.28130 • Published 5 days ago • 16
Nemotron 3 Nano Omni: Efficient and Open Multimodal Intelligence Paper • 2604.24954 • Published 8 days ago • 17
Map2World: Segment Map Conditioned Text to 3D World Generation Paper • 2605.00781 • Published 4 days ago • 13
World-R1: Reinforcing 3D Constraints for Text-to-Video Generation Paper • 2604.24764 • Published 8 days ago • 115
GLM-5V-Turbo: Toward a Native Foundation Model for Multimodal Agents Paper • 2604.26752 • Published 6 days ago • 92
WildDet3D: Scaling Promptable 3D Detection in the Wild Paper • 2604.08626 • Published 26 days ago • 245
Seedance 2.0: Advancing Video Generation for World Complexity Paper • 2604.14148 • Published 20 days ago • 155
HY-Embodied-0.5: Embodied Foundation Models for Real-World Agents Paper • 2604.07430 • Published 27 days ago • 187
Boxer: Robust Lifting of Open-World 2D Bounding Boxes to 3D Paper • 2604.05212 • Published 29 days ago • 2
GaussianGPT: Towards Autoregressive 3D Gaussian Scene Generation Paper • 2603.26661 • Published Mar 27 • 26
InCoder-32B-Thinking: Industrial Code World Model for Thinking Paper • 2604.03144 • Published Apr 3 • 233
TurboQuant: Online Vector Quantization with Near-optimal Distortion Rate Paper • 2504.19874 • Published Apr 28, 2025 • 34
Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation Paper • 2603.19220 • Published Mar 19 • 66
Bootstrapping Exploration with Group-Level Natural Language Feedback in Reinforcement Learning Paper • 2603.04597 • Published Mar 4 • 210