view article Article **ColBERT-Zero: To Pre-train Or Not To Pre-train ColBERT models?** 11 days ago • 17
view article Article LightOnOCR-1B: The Case for End-to-End and Efficient Domain-Specific Vision-Language Models for OCR Oct 23, 2025 • 73
view article Article Binary and Scalar Embedding Quantization for Significantly Faster & Cheaper Retrieval +1 Mar 22, 2024 • 128
eugeneyan/semantic-id-qwen3-8b-video-games Text Generation • 8B • Updated Sep 14, 2025 • 39 • 3
HuggingFaceTB/SmolVLM2-500M-Video-Instruct Image-Text-to-Text • Updated Apr 8, 2025 • 196k • 119