pythia410m-sft-tldr / code /configs /dpo_relabel_summarize_generated_1b_dpo.yml

Commit History

Training in progress, step 500
1904ee8
verified

mnoukhov commited on