deepseek-ai/DeepSeek-V3.1-Terminus Text Generation β’ 685B β’ Updated Sep 29, 2025 β’ 10k β’ β’ 359
Qwen/Qwen3-Next-80B-A3B-Instruct Text Generation β’ 81B β’ Updated Sep 17, 2025 β’ 2.01M β’ β’ 909
Running 216 FineVision: Open Data is All You Need π 216 A new open-source dataset for training VLMs
google/embeddinggemma-300m Sentence Similarity β’ 0.3B β’ Updated Sep 25, 2025 β’ 674k β’ β’ 1.42k
Paused Featured 803 Qwen Image Edit β 803 Edit and enhance images based on descriptive instructions
ngxson/Home-Cook-Mistral-Small-Omni-24B-2507-GGUF Any-to-Any β’ 24B β’ Updated Jul 28, 2025 β’ 320 β’ 27
Running 3.66k The Ultra-Scale Playbook π 3.66k The ultimate guide to training LLM on large GPU Clusters