-
VerlTool: Towards Holistic Agentic Reinforcement Learning with Tool Use
Paper • 2509.01055 • Published • 79 -
AgentRL: Scaling Agentic Reinforcement Learning with a Multi-Turn, Multi-Task Framework
Paper • 2510.04206 • Published • 3 -
In-the-Flow Agentic System Optimization for Effective Planning and Tool Use
Paper • 2510.05592 • Published • 107 -
D-CORE: Incentivizing Task Decomposition in Large Reasoning Models for Complex Tool Use
Paper • 2602.02160 • Published • 13