Soohak: A Mathematician-Curated Benchmark for Evaluating Research-level Math Capabilities of LLMs Paper • 2605.09063 • Published 4 days ago • 62
Running on CPU Upgrade 486 Visualize Dataset (v2.0+ latest dataset format) 💻 486 Explore and visualize LeRobot datasets easily