Automatic Curriculum Expert Iteration for Reliable LLM Reasoning Paper • 2410.07627 • Published Oct 10, 2024
Large Language Models as Commonsense Knowledge for Large-Scale Task Planning Paper • 2305.14078 • Published May 23, 2023
MCP-Universe: Benchmarking Large Language Models with Real-World Model Context Protocol Servers Paper • 2508.14704 • Published Aug 20, 2025 • 43
On the Empirical Complexity of Reasoning and Planning in LLMs Paper • 2404.11041 • Published Apr 17, 2024 • 1
GPA: Learning GUI Process Automation from Demonstrations Paper • 2604.01676 • Published 2 days ago • 7
GPA: Learning GUI Process Automation from Demonstrations Paper • 2604.01676 • Published 2 days ago • 7