What is the fine-tuning technique used?

#18
by niravlg - opened

The README does not provide any deatils on the exact fine-tuning recipe. Could you elaborate if you used LoRA/full fine-tuning or any alignment techniques (DPO).

NousResearch org

full fine tuning and DPO

teknium changed discussion status to closed

Sign up or log in to comment