Transformers
GGUF
English
alignment-handbook
trl
dpo
Generated from Trainer
Inference Endpoints
conversational
Mistral-Nemo-MCAI-SFT-DPO-GGUF / Mistral-Nemo-MCAI-SFT-DPO.Q5_K_M.gguf

Commit History

uploaded from nethype/db1
0fe9285
verified

mradermacher commited on