RLHF dataset

by jaredjoss - opened Jul 25

Jul 25

Hi!

I understand that you used a subset of 15k examples of the Anthropic/hh-rlhf dataset to fine-tune this model.
How did you make this split and is this dataset available anywhere?

Thank you!

jaredjoss changed discussion status to closed Aug 8

jaredjoss changed discussion status to open Aug 8

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment