CLEAR: Unlocking Generative Potential for Degraded Image Understanding in Unified Multimodal Models Paper • 2604.04780 • Published 5 days ago • 9
TRACE: Task-Adaptive Reasoning and Representation Learning for Universal Multimodal Retrieval Paper • 2603.02929 • Published Mar 4
WISER: Wider Search, Deeper Thinking, and Adaptive Fusion for Training-Free Zero-Shot Composed Image Retrieval Paper • 2602.23029 • Published 18 days ago • 1
ReCALL: Recalibrating Capability Degradation for MLLM-based Composed Image Retrieval Paper • 2602.01639 • Published 11 days ago
PLUME: Latent Reasoning Based Universal Multimodal Embedding Paper • 2604.02073 • Published 9 days ago • 13
CLEAR: Unlocking Generative Potential for Degraded Image Understanding in Unified Multimodal Models Paper • 2604.04780 • Published 5 days ago • 9
PLUME: Latent Reasoning Based Universal Multimodal Embedding Paper • 2604.02073 • Published 9 days ago • 13
COOPER: A Unified Model for Cooperative Perception and Reasoning in Spatial Intelligence Paper • 2512.04563 • Published Dec 4, 2025 • 16
UniFGVC: Universal Training-Free Few-Shot Fine-Grained Vision Classification via Attribute-Aware Multimodal Retrieval Paper • 2508.04136 • Published Aug 6, 2025
Referring Expression Instance Retrieval and A Strong End-to-End Baseline Paper • 2506.18246 • Published Jun 23, 2025