pythia410m-sft-tldr / code /rl_training_value_model.py

Commit History

Training in progress, step 500
1904ee8
verified

mnoukhov commited on