NeoVerse: Enhancing 4D World Model with in-the-wild Monocular Videos Paper • 2601.00393 • Published 8 days ago • 106 • 4
Effort: Efficient Orthogonal Modeling for Generalizable AI-Generated Image Detection Paper • 2411.15633 • Published Nov 23, 2024 • 1
InfiniDepth: Arbitrary-Resolution and Fine-Grained Depth Estimation with Neural Implicit Fields Paper • 2601.03252 • Published 3 days ago • 88 • 9
Taming Hallucinations: Boosting MLLMs' Video Understanding via Counterfactual Video Generation Paper • 2512.24271 • Published 10 days ago • 49 • 5
Can We Trust AI Explanations? Evidence of Systematic Underreporting in Chain-of-Thought Reasoning Paper • 2601.00830 • Published 16 days ago • 2 • 3
Can LLMs Predict Their Own Failures? Self-Awareness via Internal Circuits Paper • 2512.20578 • Published 17 days ago • 66 • 4
Youtu-Agent: Scaling Agent Productivity with Automated Generation and Hybrid Policy Optimization Paper • 2512.24615 • Published 10 days ago • 104 • 4
Patch-as-Decodable-Token: Towards Unified Multi-Modal Vision Tasks in MLLMs Paper • 2510.01954 • Published Oct 2, 2025 • 12 • 3
Avatar Forcing: Real-Time Interactive Head Avatar Generation for Natural Conversation Paper • 2601.00664 • Published 7 days ago • 48 • 3
DiffThinker: Towards Generative Multimodal Reasoning with Diffusion Models Paper • 2512.24165 • Published 10 days ago • 47 • 5
InsertAnywhere: Bridging 4D Scene Geometry and Diffusion Models for Realistic Video Object Insertion Paper • 2512.17504 • Published 21 days ago • 95 • 4
Dynamic Large Concept Models: Latent Reasoning in an Adaptive Semantic Space Paper • 2512.24617 • Published 10 days ago • 54 • 4
UltraShape 1.0: High-Fidelity 3D Shape Generation via Scalable Geometric Refinement Paper • 2512.21185 • Published 16 days ago • 28 • 4
Leveraging LLMs for Legacy Code Modernization: Challenges and Opportunities for LLM-Generated Documentation Paper • 2411.14971 • Published Nov 22, 2024 • 1
Coupling Experts and Routers in Mixture-of-Experts via an Auxiliary Loss Paper • 2512.23447 • Published 11 days ago • 93 • 4
OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis Paper • 2412.19723 • Published Dec 27, 2024 • 87 • 4