TreeCUA: Efficiently Scaling GUI Automation with Tree-Structured Verifiable Evolution Paper • 2602.09662 • Published 2 days ago • 6
VinciCoder: Unifying Multimodal Code Generation via Coarse-to-fine Visual Reinforcement Learning Paper • 2511.00391 • Published Nov 1, 2025 • 1
OCRVerse: Towards Holistic OCR in End-to-End Vision-Language Models Paper • 2601.21639 • Published 14 days ago • 49