GEMS: Agent-Native Multimodal Generation with Memory and Skills Paper • 2603.28088 • Published 6 days ago • 81
PerceptionComp: A Video Benchmark for Complex Perception-Centric Reasoning Paper • 2603.26653 • Published 9 days ago • 16