MultihopSpatial: Multi-hop Compositional Spatial Reasoning Benchmark for Vision-Language Model
AI & ML interests
Visual Intelligence, Pretrained Vision-and-Language Model, Embodied AI, Collaborative Agents, Vision Task(Object Detection, Segmentation)