RLHF dataset

#2
by jaredjoss - opened

Hi!

I understand that you used a subset of 15k examples of the Anthropic/hh-rlhf dataset to fine-tune this model.
How did you make this split and is this dataset available anywhere?

Thank you!

jaredjoss changed discussion status to closed
jaredjoss changed discussion status to open

Sign up or log in to comment