Scaf-GRPO: Scaffolded Group Relative Policy Optimization for Enhancing LLM Reasoning Paper • 2510.19807 • Published Oct 22, 2025
SearchGym: Bootstrapping Real-World Search Agents via Cost-Effective and High-Fidelity Environment Simulation Paper • 2601.14615 • Published Jan 21
VisionDirector: Vision-Language Guided Closed-Loop Refinement for Generative Image Synthesis Paper • 2512.19243 • Published Dec 22, 2025
VisionDirector: Vision-Language Guided Closed-Loop Refinement for Generative Image Synthesis Paper • 2512.19243 • Published Dec 22, 2025
MedAgentBoard: Benchmarking Multi-Agent Collaboration with Conventional Methods for Diverse Medical Tasks Paper • 2505.12371 • Published May 18, 2025
SmartSwitch: Advancing LLM Reasoning by Overcoming Underthinking via Promoting Deeper Thought Exploration Paper • 2510.19767 • Published Oct 22, 2025