ibm-granite/granite-docling-258M Image-Text-to-Text • 0.3B • Updated Sep 23, 2025 • 214k • 1.08k
Qwen/Qwen3-235B-A22B-Instruct-2507 Text Generation • 235B • Updated Sep 17, 2025 • 101k • • 742
google/timesfm-2.0-500m-pytorch Time Series Forecasting • 0.5B • Updated Apr 16, 2025 • 7.84k • 231
twinkle-ai/Llama-3.2-3B-F1-Reasoning-Instruct Text Generation • 4B • Updated Sep 9, 2025 • 6 • 46
Running 3.63k The Ultra-Scale Playbook 🌌 3.63k The ultimate guide to training LLM on large GPU Clusters