DVIS-DAQ: Improving Video Segmentation via Dynamic Anchor Queries Paper • 2404.00086 • Published Mar 29, 2024
Are They the Same? Exploring Visual Correspondence Shortcomings of Multimodal LLMs Paper • 2501.04670 • Published Jan 8, 2025
DVIS++: Improved Decoupled Framework for Universal Video Segmentation Paper • 2312.13305 • Published Dec 20, 2023
DenseWorld-1M: Towards Detailed Dense Grounded Caption in the Real World Paper • 2506.24102 • Published Jun 30, 2025
The 1st Solution for 7th LSVOS RVOS Track: SaSaSa2VA Paper • 2509.16972 • Published Sep 21, 2025 • 2
Beyond Appearance: Geometric Cues for Robust Video Instance Segmentation Paper • 2507.05948 • Published Jul 8, 2025 • 1
Grasp Any Region: Towards Precise, Contextual Pixel Understanding for Multimodal LLMs Paper • 2510.18876 • Published Oct 21, 2025 • 37
LSVOS 2025 Challenge Report: Recent Advances in Complex Video Object Segmentation Paper • 2510.11063 • Published Oct 13, 2025 • 1