-
-
-
-
-
-
Inference Providers
Active filters:
4bit
ModelCloud/Brumby-14B-Base-GPTQMODEL-W4A16
Text Generation
•
15B
•
Updated
•
6
ModelCloud/Brumby-14B-Base-GPTQMODEL-W4A16-v2
Text Generation
•
15B
•
Updated
•
6
CHF0101/medquad-lora-r4-best
Updated
CHF0101/medquad-lora-r4-best-v2
Updated
CHF0101/medquad-lora-r32-best-v2
Updated
sweatSmile/Phi3-Mini-FinSight-FinancialQA
Text Generation
•
4B
•
Updated
•
4
•
1
ikarius/Granite-3.2-8b-instruct-Abliterated-NF4
Text Generation
•
8B
•
Updated
•
1
•
1
ikarius/NeuralDaredevil-8B-abliterated-NF4
Text Generation
•
8B
•
Updated
•
9
•
1
ikarius/Qwen2.5-Coder-14B-Instruct-Abliterated-NF4
Text Generation
•
15B
•
Updated
•
2
•
1
4B
•
Updated
•
11
lunovian/Qwen2.5-Math-7B-Instruct-4bit
2B
•
Updated
•
2
Plurigrid/DR-Tulu-8B-MLX-4bit
1B
•
Updated
•
10
ujjwal52/Llama-2-7b-FLASH-UK
Text Generation
•
7B
•
Updated
•
3
•
1
Plurigrid/Olmo-3-32B-Think-MLX-4bit
Text Generation
•
32B
•
Updated
•
12
beta3/gemma3_1b_title_generator
Sugandha-Chauhan/BioMistral-7B-SymptomDiagnosis
Text Classification
•
Updated
•
8
Text Generation
•
0.6B
•
Updated
•
99
•
3
ikarius/Qwen2.5-Coder-32B-Instruct-Abliterated-NF4
33B
•
Updated
•
4
•
1
smkrv/Qwen3-0.6B-CoreML-4bit
Text Generation
•
Updated
•
17
comptechco/WeThink-Qwen2.5VL-7B-bnb-4bit
Text Generation
•
8B
•
Updated
•
39
codewithdark/Llama-3.2-3B-4bit-mlx
Text Generation
•
3B
•
Updated
•
109
mhmdelbadry1/qwen-reasoning-grpo-4bit
Reinforcement Learning
•
2B
•
Updated
•
33
seochan99/Qwen-Image-Edit-2511-bnb-nf4
Image-to-Image
•
Updated
•
80
bisonnetworking/MediPhi-Instruct-mlx-4bit
Text Generation
•
0.6B
•
Updated
•
80
marksverdhai/vibevoice-7b-bnb-4bit
Text-to-Speech
•
10B
•
Updated
•
15
Abdullah-afify/Elgizawy-EN_AR-translation-style-qwen-3-8b-Q4_K_M
Translation
•
8B
•
Updated
•
67