A fine-grained visual reasoning benchmark (We show more question types in the extension dataset.)
Sicheng Feng
FSCCS
·
AI & ML interests
None yet
Recent Activity
updated
a dataset
5 days ago
FSCCS/ReasonMap
upvoted
a
paper
10 days ago
OmniAgent: Audio-Guided Active Perception Agent for Omnimodal Audio-Video Understanding