Base Model
updated
mistralai/Mistral-Small-3.1-24B-Base-2503
Updated • 7.11k
• 272
Text Generation
• Updated • 252k
• • 164
Text Generation
• 22B • Updated • 6.51M
• • 4.57k
Text Generation
• 685B • Updated • 3.9M
• • 13.3k
Text Generation
• Updated • 10.6k
• 301
baidu/ERNIE-4.5-0.3B-Base-PT
Text Generation
• Updated • 2.92k
• 22
Text Generation
• 1B • Updated • 1.34M
• 2.38k
Updated • 27.6k
• 1.07k
deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B
Text Generation
• 2B • Updated • 503k
• • 1.49k
baidu/ERNIE-4.5-VL-28B-A3B-Thinking
Image-Text-to-Text
• 30B • Updated • 2.02k
• 536
deepseek-ai/DeepSeek-R1-Zero
Text Generation
• Updated • 5.74k
• 956
Text Generation
• 9B • Updated • 20.8k
• • 104
Text Generation
• 0.4B • Updated • 25.7k
• 249
Text Generation
• Updated • 374
• 41
microsoft/Phi-4-mini-flash-reasoning
Text Generation
• Updated • 919
• 275
Qwen/Qwen3-VL-2B-Instruct
Image-Text-to-Text
• 2B • Updated • 166M
• 377
deepseek-ai/DeepSeek-V3.2-Exp
Text Generation
• Updated • 201k
• • 992
tencent/Hunyuan-0.5B-Pretrain
Text Generation
• 0.5B • Updated • 2.82k
• 11
Text Generation
• 7B • Updated • 121k
• 66