WISER: Wider Search, Deeper Thinking, and Adaptive Fusion for Training-Free Zero-Shot Composed Image Retrieval Paper • 2602.23029 • Published 19 days ago • 1
CLEAR: Unlocking Generative Potential for Degraded Image Understanding in Unified Multimodal Models Paper • 2604.04780 • Published 6 days ago • 9
PLUME: Latent Reasoning Based Universal Multimodal Embedding Paper • 2604.02073 • Published 10 days ago • 13
Unifying Group-Relative and Self-Distillation Policy Optimization via Sample Routing Paper • 2604.02288 • Published 10 days ago • 27
COOPER: A Unified Model for Cooperative Perception and Reasoning in Spatial Intelligence Paper • 2512.04563 • Published Dec 4, 2025 • 16