Transformers
GGUF
English
alignment-handbook
trl
dpo
Generated from Trainer
Inference Endpoints
conversational
Mistral-Nemo-MCAI-SFT-DPO-GGUF / Mistral-Nemo-MCAI-SFT-DPO.IQ4_XS.gguf

Commit History

uploaded from nethype/db1
f9abaad
verified

mradermacher commited on