Inference Providers
Active filters: dpo
sweepai/sweep-next-edit-v2-7B
Text Generation
• 8B • Updated • 377
• 22
F16/z-image-turbo-flow-dpo
Feature Extraction
• Updated • 173
danielcherubini/Qwen3.5-DeltaCoder-9B-GGUF
Text Generation
• 9B • Updated • 4.26k
• 14
BugTraceAI/BugTraceAI-Apex-G4-26B-Q4
25B • Updated • 17.7k
• 57
mlabonne/NeuralDaredevil-8B-abliterated
Text Generation
• 8B • Updated • 16.2k
• • 269
mradermacher/G-Health-14B-Base-i1-GGUF
15B • Updated • 150
• 2
ToastyPigeon/Qwen3.5-27B-Heretic-Marvin-V1
27B • Updated • 15
• 2
Reinforcement Learning
• 3B • Updated • 13
• 1
Text Generation
• 10B • Updated • 49
• 1
ArmadaOS/AOS-Chief-of-Staff-v1.0
Text Generation
• 10B • Updated • 53
• 1
VladShash/deepseek-math-7b-lean-prover-dpo-olmo-3
Text Generation
• 7B • Updated • 3.18k
• 4
ReXeeD/Luminus-1.5B-Roleplay
Text Generation
• 2B • Updated • 651
• 1
ReXeeD/Luminus-1.5B-Roleplay-GGUF
Text Generation
• 2B • Updated • 1.9k
• 1
mradermacher/dotnet-coder-14b-GGUF
15B • Updated • 457
• 1
apol/alia-40b-distill-vapol
Text Generation
• 40B • Updated • 817
• 1
HCY123902/llama-3-8b-inst-dpo-on-p-twj-beta-1e-0
Text Generation
• 266k • Updated • 19
• 1
Olak17/Qwen2.5-Coder-1.5B-Unsensored-DPO-i1-GGUF
2B • Updated • 3.42k
• 2
zipaltrivedi/dotnet-coder-14b
Text Generation
• 15B • Updated • 3.98k
• 5
F16/z-image-turbo-masked-dpo
Text-to-Image
• Updated • • 18
lyogavin/Anima33B-DPO-Belle-1k
Text Generation
• Updated • 1
lyogavin/Anima33B-DPO-Belle-1k-merged
Text Generation
• Updated • 9
• 12
daekeun-ml/Llama-2-ko-DPO-13B
Text Generation
• 13B • Updated • 6
• 19
lewtun/zephyr-7b-dpo-full
Text Generation
• 7B • Updated • 6
alignment-handbook/zephyr-7b-dpo-full
Text Generation
• 7B • Updated • 17
• 3
alignment-handbook/zephyr-7b-dpo-qlora
Updated • 19
• 9
Text Generation
• Updated • 12
• 7
argilla/notus-7b-v1-lora-adapter
Text Generation
• Updated • 3
Text Generation
• 7B • Updated • 98
• 123
ContextualAI/archangel_sft_pythia1-4b
Text Generation
• 1B • Updated • 8
ContextualAI/archangel_sft_pythia2-8b
Text Generation
• 3B • Updated • 12
• 1