Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
heegyu
's Collections
Korean Reward Modeling
Korean Pretraining Dataset
RLHF papers
Reward Modeling Datasets
Pre-training Dataset
Vision LM
Image Generation
Domain Specific (Math, Code, etc)
Machine Translation
Safety LM
Text2SQL
Safety LM
updated
Sep 10
Upvote
-
meta-llama/LlamaGuard-7b
Text Generation
•
Updated
Apr 17
•
5.32k
•
212
meta-llama/Meta-Llama-Guard-2-8B
Text Generation
•
Updated
May 13
•
10.6k
•
281
OpenSafetyLab/MD-Judge-v0.1
Text Generation
•
Updated
May 20
•
301
•
13
mcj311/saladbench_data
Viewer
•
Updated
Mar 28
•
30.4k
•
44
openbmb/UltraSafety
Viewer
•
Updated
Mar 16
•
3k
•
109
•
27
PKU-Alignment/BeaverTails
Viewer
•
Updated
Oct 17, 2023
•
364k
•
2.91k
•
32
PKU-Alignment/PKU-SafeRLHF
Viewer
•
Updated
18 days ago
•
164k
•
3.3k
•
114
Anthropic/hh-rlhf
Viewer
•
Updated
May 26, 2023
•
169k
•
8.44k
•
1.19k
lmsys/toxic-chat
Viewer
•
Updated
May 14
•
20.3k
•
3.62k
•
135
mmathys/openai-moderation-api-evaluation
Viewer
•
Updated
Aug 28, 2023
•
1.68k
•
208
•
18
allenai/WildChat-1M
Viewer
•
Updated
19 days ago
•
838k
•
1.45k
•
279
allenai/wildjailbreak
Viewer
•
Updated
Aug 8
•
2.21k
•
1.25k
•
23
allenai/wildguardmix
Viewer
•
Updated
Jun 29
•
88.5k
•
2.51k
•
12
allenai/xstest-response
Viewer
•
Updated
Jun 29
•
895
•
459
•
2
walledai/XSTest
Viewer
•
Updated
Jul 4
•
450
•
1.21k
•
3
meta-llama/Llama-Guard-3-8B
Text Generation
•
Updated
25 days ago
•
91.7k
•
120
Upvote
-
Share collection
View history
Collection guide
Browse collections