lm-human-preference-details

Request to join this org

AI & ML interests

None defined yet.

Collections 1

spaces 1

Rlhf Demo

models 63

lm-human-preference-details/train_policy_accelerate_tf_adam_gpt2_xl_grad_accu__sentiment_offline_5k.json__seed1

Text Generation • Updated Oct 6, 2023 • 12

lm-human-preference-details/train_policy_accelerate_tf_adam_gpt2_xl_grad_accu__sentiment_offline_5k.json__seed5

Text Generation • Updated Oct 6, 2023 • 12

lm-human-preference-details/train_policy_accelerate_tf_adam_gpt2_xl_grad_accu__sentiment_offline_5k.json__seed3

Text Generation • Updated Oct 6, 2023 • 11

lm-human-preference-details/train_policy_accelerate_tf_adam_gpt2_xl_grad_accu__sentiment_offline_5k.json__seed4

Text Generation • Updated Oct 6, 2023 • 13

lm-human-preference-details/train_policy_accelerate_tf_adam_gpt2_xl_grad_accu__sentiment_offline_5k.json__seed2

Text Generation • Updated Oct 6, 2023 • 11

lm-human-preference-details/train_policy_accelerate_pt_adam_gpt2_xl_grad_accu__sentiment_offline_5k.json__seed3

Text Generation • Updated Oct 6, 2023 • 13

lm-human-preference-details/train_policy_accelerate_pt_adam_gpt2_xl_grad_accu__sentiment_offline_5k.json__seed5

Text Generation • Updated Oct 6, 2023 • 14

lm-human-preference-details/train_policy_accelerate_pt_adam_gpt2_xl_grad_accu__sentiment_offline_5k.json__seed2

Text Generation • Updated Oct 6, 2023 • 11

lm-human-preference-details/train_policy_accelerate_pt_adam_gpt2_xl_grad_accu__sentiment_offline_5k.json__seed4

Text Generation • Updated Oct 6, 2023 • 11

lm-human-preference-details/train_policy_accelerate_pt_adam_gpt2__sentiment_offline_5k.json__seed5

Text Generation • Updated Oct 6, 2023 • 11

datasets

None public yet