Puristan_Spaces Runtime error Agents 3 Indic ParlerTTS Urdu 🦀 3 IndicParler_TTS for Urdu_Punjabi & Sindhi
MultiModal Models Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks Paper • 2311.06242 • Published Nov 10, 2023 • 96
Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks Paper • 2311.06242 • Published Nov 10, 2023 • 96
Pspets multimodal GPT-4V(ision) is a Generalist Web Agent, if Grounded Paper • 2401.01614 • Published Jan 3, 2024 • 23
GPT-4V(ision) is a Generalist Web Agent, if Grounded Paper • 2401.01614 • Published Jan 3, 2024 • 23
Document Understanding Models vidore/colpali Visual Document Retrieval • Updated Nov 24, 2025 • 2.18k • 477
Puristan_Spaces Runtime error Agents 3 Indic ParlerTTS Urdu 🦀 3 IndicParler_TTS for Urdu_Punjabi & Sindhi
Pspets multimodal GPT-4V(ision) is a Generalist Web Agent, if Grounded Paper • 2401.01614 • Published Jan 3, 2024 • 23
GPT-4V(ision) is a Generalist Web Agent, if Grounded Paper • 2401.01614 • Published Jan 3, 2024 • 23
MultiModal Models Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks Paper • 2311.06242 • Published Nov 10, 2023 • 96
Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks Paper • 2311.06242 • Published Nov 10, 2023 • 96
Document Understanding Models vidore/colpali Visual Document Retrieval • Updated Nov 24, 2025 • 2.18k • 477