-
-
-
-
-
-
Inference status
Active filters:
dpo
mradermacher/Mistral-Nemo-Instruct-MCAI-SFT-DPO-revision-only-GGUF
Updated
•
208
•
1
mradermacher/Mistral-Nemo-Instruct-MCAI-SFT-DPO-revision-only-i1-GGUF
Updated
•
440
•
1
bartowski/Human-Like-Qwen2.5-7B-Instruct-GGUF
Text Generation
•
Updated
•
190
•
2
VAGOsolutions/SauerkrautLM-v2-14b-DPO
Updated
•
313
•
18
andito/SmolLM2-1.7B-Instruct-F16-GGUF
Updated
•
95
•
1
HuggingFaceTB/SmolVLM-Instruct-DPO
Image-Text-to-Text
•
Updated
•
395
•
16
sapienzanlp/Minerva-7B-instruct-v1.0
Text Generation
•
Updated
•
2.81k
•
14
sapienzanlp/Minerva-7B-instruct-v1.0-GGUF
Text Generation
•
Updated
•
141
•
3
mradermacher/SauerkrautLM-v2-14b-DPO-GGUF
Updated
•
101
•
1
mradermacher/SauerkrautLM-v2-14b-DPO-i1-GGUF
Updated
•
262
•
1
XueyingJia/Qwen2-1.5B-instruct-dpo
mradermacher/Llama-3-8B-Instruct-DPO-v0.3-GGUF
Updated
•
51
•
1
mradermacher/Llama-3-8B-Instruct-DPO-v0.3-i1-GGUF
Updated
•
109
•
1
mradermacher/janus-dpo-7b-GGUF
Updated
•
157
•
1
mradermacher/janus-dpo-7b-i1-GGUF
Updated
•
315
•
1
mradermacher/distilabeled-Marcoro14-7B-slerp-full-GGUF
Updated
•
155
•
1
phunguyen01/II-Tulu-8B-DPO-Exp
Text Generation
•
Updated
•
34
•
1
mradermacher/II-Tulu-8B-DPO-GGUF
Updated
•
212
•
1
mradermacher/II-Tulu-8B-DPO-i1-GGUF
Updated
•
406
•
1
mradermacher/distilabeled-Marcoro14-7B-slerp-full-i1-GGUF
Updated
•
237
•
1
mgat1/SmolLM2-FT-DPO
Text Generation
•
Updated
•
10
•
1
mradermacher/Llama-3.1-8B-sft-SPIN-self-GGUF
Updated
•
340
•
1
mradermacher/llama-3-8b-DPO-GGUF
Updated
•
369
•
1
mradermacher/Llama-3-8B-Instruct-64k-GGUF
Updated
•
296
•
1
mradermacher/Llama-3-8B-Instruct-64k-i1-GGUF
Updated
•
625
•
1
mradermacher/ContY-v0.2-8B-GGUF
Updated
•
376
•
1
AIR-hl/Llama-3.2-3B-DPO
Text Generation
•
Updated
•
79
•
2
li-muyang/zephyr-7b-dpo-full
Text Generation
•
Updated
•
75
•
1
mradermacher/Llama-3.2-3B-DPO-GGUF
Updated
•
308
•
1
mradermacher/lambda-qwen2.5-14b-dpo-test-GGUF
Updated
•
326
•
1