FantasyVLN: Unified Multimodal Chain-of-Thought Reasoning for Vision-Language Navigation Paper • 2601.13976 • Published 18 days ago • 21
Running on Zero Featured 826 Florence 2 📉 826 Generate detailed captions and analyze images with Florence-2