dctanner
/

sablo-pebble-mistral-dpo-lora-HelpSteer_binarized

alignment-handbook

Generated from Trainer

Model card Files Files and versions Metrics Training metrics Community

sablo-pebble-mistral-dpo-lora-HelpSteer_binarized / runs /Jan23_20-43-15_dpo-tests-2-85c5c9854f-p9wdp

1 contributor

History: 10 commits

dctanner's picture

Model save

609f4da verified 10 months ago

events.out.tfevents.1706043566.dpo-tests-2-85c5c9854f-p9wdp.12217.0

77 kB
LFS

Model save 10 months ago
events.out.tfevents.1706056801.dpo-tests-2-85c5c9854f-p9wdp.12217.1

828 Bytes
LFS

Model save 10 months ago