CANVAS: A Benchmark for Vision-Language Models on Tool-Based User Interface Design Paper • 2511.20737 • Published Nov 25, 2025 • 3
Vision-aligned Latent Reasoning for Multi-modal Large Language Model Paper • 2602.04476 • Published 7 days ago • 15