openai/whisper-large-v3 Automatic Speech Recognition β’ 2B β’ Updated Aug 12, 2024 β’ 4.95M β’ β’ 5.68k
HuggingFaceM4/siglip-so400m-14-980-flash-attn2-navit Zero-Shot Image Classification β’ 0.9B β’ Updated Mar 7, 2024 β’ 91 β’ 53
Alibaba-NLP/Tongyi-DeepResearch-30B-A3B Text Generation β’ 31B β’ Updated Oct 10, 2025 β’ 99.2k β’ 811
Running on CPU Upgrade Featured 3.17k The Smol Training Playbook π 3.17k The secrets to building world-class LLMs
Running 3.84k The Ultra-Scale Playbook π 3.84k The ultimate guide to training LLM on large GPU Clusters
deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B Text Generation β’ 2B β’ Updated Feb 24, 2025 β’ 521k β’ β’ 1.5k
Running 596 Scaling test-time compute π 596 Run advanced search strategies to boost LLM problem solving