pythia410m-sft-tldr / code /configs /dpo1b_relabel_generated_same_prompts.yml

Commit History

Training in progress, step 500
1904ee8
verified

mnoukhov commited on