What is the fine-tuning technique used?

#18

by niravlg - opened Jun 10

Jun 10

The README does not provide any deatils on the exact fine-tuning recipe. Could you elaborate if you used LoRA/full fine-tuning or any alignment techniques (DPO).

teknium

NousResearch org Jul 11

full fine tuning and DPO

teknium changed discussion status to closed Jul 11

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment