Transformers
GGUF
English
alignment-handbook
trl
dpo
Generated from Trainer
Inference Endpoints
conversational
Mistral-Nemo-MCAI-SFT-DPO-GGUF / Mistral-Nemo-MCAI-SFT-DPO.Q4_K_S.gguf

Commit History

uploaded from nethype/db1
b6d5aca
verified

mradermacher commited on