pythia410m-sft-tldr / code /configs /dpo_eval_costa_1b_fp16.yml

Commit History

Training in progress, step 500
1904ee8
verified

mnoukhov commited on