Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
heegyu
's Collections
Korean Reward Modeling
Korean Pretraining Dataset
RLHF papers
Reward Modeling Datasets
Pre-training Dataset
Vision LM
Image Generation
Domain Specific (Math, Code, etc)
Machine Translation
Safety LM
Text2SQL
Reward Modeling Datasets
updated
Aug 4
Upvote
-
nvidia/HelpSteer
Viewer
•
Updated
Jun 24
•
37.1k
•
2.7k
•
217
Anthropic/hh-rlhf
Viewer
•
Updated
May 26, 2023
•
169k
•
8.44k
•
1.19k
stanfordnlp/SHP
Viewer
•
Updated
Oct 10, 2023
•
386k
•
1.77k
•
293
PKU-Alignment/PKU-SafeRLHF
Viewer
•
Updated
18 days ago
•
164k
•
3.3k
•
114
openai/webgpt_comparisons
Viewer
•
Updated
Dec 19, 2022
•
19.6k
•
351
•
224
openai/summarize_from_feedback
Viewer
•
Updated
Jan 3, 2023
•
194k
•
1.17k
•
188
HuggingFaceH4/ultrafeedback_binarized
Viewer
•
Updated
20 days ago
•
187k
•
6.16k
•
234
berkeley-nest/Nectar
Viewer
•
Updated
Mar 20
•
183k
•
575
•
276
HuggingFaceH4/stack-exchange-preferences
Viewer
•
Updated
Mar 8, 2023
•
10.8M
•
1.5k
•
121
HuggingFaceH4/hhh_alignment
Viewer
•
Updated
Mar 2, 2023
•
221
•
223
•
16
Birchlabs/openai-prm800k-stepwise-critic
Viewer
•
Updated
Jun 3, 2023
•
1.09M
•
531
•
43
prometheus-eval/Feedback-Collection
Viewer
•
Updated
Oct 14, 2023
•
100k
•
604
•
106
argilla/OpenHermesPreferences
Viewer
•
Updated
Mar 1
•
989k
•
4.85k
•
198
allenai/reward-bench
Viewer
•
Updated
Sep 9
•
8.11k
•
7.1k
•
73
nvidia/HelpSteer2
Viewer
•
Updated
21 days ago
•
21.4k
•
18.6k
•
359
Magpie-Align/Magpie-Pro-DPO-200K
Viewer
•
Updated
Aug 20
•
207k
•
41
•
5
argilla/magpie-ultra-v0.1
Viewer
•
Updated
27 days ago
•
50k
•
542
•
210
Upvote
-
Share collection
View history
Collection guide
Browse collections