Granite Embedding Collection Embedding models (bi‑encoders and rerankers) for RAG, semantic search, and retrieval tasks. • 9 items • Updated 5 days ago • 42
Granite 4.1 Language Models Collection Efficient language models for multilingual generation, coding, RAG, and AI assistant workflows. • 6 items • Updated 6 days ago • 45
Llama 3.2 Collection This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 15 items • Updated Dec 6, 2024 • 670
Mistral Small 4 Collection A state-of-the-art model, open-weight, with a granular Mixture-of-Experts architecture that fuses instruct, reasoning and agentic skills. • 3 items • Updated Mar 16 • 72
Phi-3 Collection Phi-3 family of small language and multi-modal models. Language models are available in short- and long-context lengths. • 25 items • Updated Mar 2 • 580
Devstral 2 Collection Collection for Devstral-Small-2-24B-Instruct-2512 models • 2 items • Updated Dec 19, 2025 • 1
Multimodal GGUFs Collection Vision and audio models compatible with llama-server and llama-mtmd-cli • 16 items • Updated Dec 18, 2025 • 20
Embedding Models Collection Run or fine-tune embedding models with Unsloth. • 14 items • Updated 13 days ago • 6
Unsloth Diffusion GGUFs Collection Find GGUFs and other variants of diffusion based models like Qwen-Image and FLUX. • 20 items • Updated 13 days ago • 82
Gemma 4 Collection Gemma 4 is Google's new model family including including E2B, E4B, 26B-A4B, and 31B. • 28 items • Updated 13 days ago • 171
Jan v1 Collection Jan-v1 is the first release in the Jan Family, designed for agentic reasoning and problem-solving within the Jan App • 6 items • Updated Sep 10, 2025 • 8