Edit Models filters

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Mixture of Experts

Carbon Emissions

Models

388

Full-text search

Active filters: 4bit

ModelCloud/Brumby-14B-Base-GPTQMODEL-W4A16

Text Generation • 15B • Updated Oct 31, 2025 • 6

ModelCloud/Brumby-14B-Base-GPTQMODEL-W4A16-v2

Text Generation • 15B • Updated Oct 31, 2025 • 6

CHF0101/medquad-lora-r4-best

Updated Nov 2, 2025

CHF0101/medquad-lora-r4-best-v2

Updated Nov 2, 2025

CHF0101/medquad-lora-r32-best-v2

Updated Nov 2, 2025

sweatSmile/Phi3-Mini-FinSight-FinancialQA

Text Generation • 4B • Updated Nov 2, 2025 • 4 • 1

ikarius/Granite-3.2-8b-instruct-Abliterated-NF4

Text Generation • 8B • Updated Nov 17, 2025 • 1 • 1

ikarius/NeuralDaredevil-8B-abliterated-NF4

Text Generation • 8B • Updated 22 days ago • 9 • 1

ikarius/Qwen2.5-Coder-14B-Instruct-Abliterated-NF4

Text Generation • 15B • Updated Nov 18, 2025 • 2 • 1

Infiniaai/teddy-3.5b

4B • Updated Nov 17, 2025 • 11

wcosmas/sbcc-qwen

Updated Nov 26, 2025

lunovian/Qwen2.5-Math-7B-Instruct-4bit

2B • Updated Nov 21, 2025 • 2

Plurigrid/DR-Tulu-8B-MLX-4bit

1B • Updated Nov 22, 2025 • 10

ujjwal52/Llama-2-7b-FLASH-UK

Text Generation • 7B • Updated Nov 22, 2025 • 3 • 1

Plurigrid/Olmo-3-32B-Think-MLX-4bit

Text Generation • 32B • Updated Nov 22, 2025 • 12

gawadx1/Krvn

Updated Nov 24, 2025 • 1

beta3/gemma3_1b_title_generator

Updated Nov 24, 2025 • 1

Sugandha-Chauhan/BioMistral-7B-SymptomDiagnosis

Text Classification • Updated Nov 29, 2025 • 8

hunterbown/dante-qwen-4b

Text Generation • 0.6B • Updated Dec 4, 2025 • 99 • 3

ikarius/Qwen2.5-Coder-32B-Instruct-Abliterated-NF4

33B • Updated Dec 1, 2025 • 4 • 1

smkrv/Qwen3-0.6B-CoreML-4bit

Text Generation • Updated Dec 3, 2025 • 17

comptechco/WeThink-Qwen2.5VL-7B-bnb-4bit

Text Generation • 8B • Updated 14 days ago • 39

codewithdark/Llama-3.2-3B-4bit-mlx

Text Generation • 3B • Updated 18 days ago • 109

mhmdelbadry1/qwen-reasoning-grpo-4bit

Reinforcement Learning • 2B • Updated 10 days ago • 33

seochan99/Qwen-Image-Edit-2511-bnb-nf4

Image-to-Image • Updated 10 days ago • 80

bisonnetworking/MediPhi-Instruct-mlx-4bit

Text Generation • 0.6B • Updated 7 days ago • 80

marksverdhai/vibevoice-7b-bnb-4bit

Text-to-Speech • 10B • Updated 6 days ago • 15

Abdullah-afify/Elgizawy-EN_AR-translation-style-qwen-3-8b-Q4_K_M

Translation • 8B • Updated 4 days ago • 67