ClawBench: Can AI Agents Complete Everyday Online Tasks? Paper • 2604.08523 • Published 2 days ago • 81
QiMeng-PRepair: Precise Code Repair via Edit-Aware Reward Optimization Paper • 2604.05963 • Published 4 days ago • 5
Towards Analyzing and Mitigating Sycophancy in Large Vision-Language Models Paper • 2408.11261 • Published Aug 21, 2024
QiMeng-PRepair Collection QiMeng-PRepair: Precise Code Repair via Edit-Aware Reward Optimization • 4 items • Updated 3 days ago
QiMeng-PRepair: Precise Code Repair via Edit-Aware Reward Optimization Paper • 2604.05963 • Published 4 days ago • 5
QiMeng-PRepair Collection QiMeng-PRepair: Precise Code Repair via Edit-Aware Reward Optimization • 4 items • Updated 3 days ago
QiMeng-PRepair: Precise Code Repair via Edit-Aware Reward Optimization Paper • 2604.05963 • Published 4 days ago • 5
QiMeng-PRepair Collection QiMeng-PRepair: Precise Code Repair via Edit-Aware Reward Optimization • 4 items • Updated 3 days ago
QiMeng-MuPa Collection QiMeng-MuPa: Mutual-Supervised Learning for Sequential-to-Parallel Code Translation • 5 items • Updated 3 days ago