DVIS-DAQ: Improving Video Segmentation via Dynamic Anchor Queries Paper โข 2404.00086 โข Published Mar 29, 2024
Are They the Same? Exploring Visual Correspondence Shortcomings of Multimodal LLMs Paper โข 2501.04670 โข Published Jan 8, 2025
DVIS++: Improved Decoupled Framework for Universal Video Segmentation Paper โข 2312.13305 โข Published Dec 20, 2023
DenseWorld-1M: Towards Detailed Dense Grounded Caption in the Real World Paper โข 2506.24102 โข Published Jun 30, 2025
The 1st Solution for 7th LSVOS RVOS Track: SaSaSa2VA Paper โข 2509.16972 โข Published Sep 21, 2025 โข 2
Beyond Appearance: Geometric Cues for Robust Video Instance Segmentation Paper โข 2507.05948 โข Published Jul 8, 2025 โข 1
Grasp Any Region: Towards Precise, Contextual Pixel Understanding for Multimodal LLMs Paper โข 2510.18876 โข Published Oct 21, 2025 โข 37
LSVOS 2025 Challenge Report: Recent Advances in Complex Video Object Segmentation Paper โข 2510.11063 โข Published Oct 13, 2025 โข 1