GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning Paper • 2604.02721 • Published 18 days ago • 363
PhoenixHu/ral_grpo_internvl2_5_how2sign_1b_bleu1_rouge_kl05_temp07_0405_metta Updated 12 days ago • 1
Adam's Law: Textual Frequency Law on Large Language Models Paper • 2604.02176 • Published 19 days ago • 481
CARLA-Air: Fly Drones Inside a CARLA World -- A Unified Infrastructure for Air-Ground Embodied Intelligence Paper • 2603.28032 • Published 22 days ago • 340
Density-aware Soft Context Compression with Semi-Dynamic Compression Ratio Paper • 2603.25926 • Published 25 days ago • 8
FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization Paper • 2603.19835 • Published Mar 20 • 339
AgentSLR: Automating Systematic Literature Reviews in Epidemiology with Agentic AI Paper • 2603.22327 • Published Mar 20 • 10
Generation Models Know Space: Unleashing Implicit 3D Priors for Scene Understanding Paper • 2603.19235 • Published Mar 19 • 95
InCoder-32B: Code Foundation Model for Industrial Scenarios Paper • 2603.16790 • Published Mar 17 • 308
Bootstrapping Exploration with Group-Level Natural Language Feedback in Reinforcement Learning Paper • 2603.04597 • Published Mar 4 • 210
Does Your Reasoning Model Implicitly Know When to Stop Thinking? Paper • 2602.08354 • Published Feb 9 • 263