Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
heegyu
's Collections
Korean Reward Modeling
Korean Pretraining Dataset
RLHF papers
Reward Modeling Datasets
Pre-training Dataset
Vision LM
Image Generation
Domain Specific (Math, Code, etc)
Machine Translation
Safety LM
Text2SQL
Safety LM
updated
Sep 10
Upvote
-
meta-llama/LlamaGuard-7b
Text Generation
•
Updated
Apr 17
•
9.18k
•
213
meta-llama/Meta-Llama-Guard-2-8B
Text Generation
•
Updated
May 13
•
11.7k
•
281
OpenSafetyLab/MD-Judge-v0.1
Text Generation
•
Updated
May 20
•
353
•
13
mcj311/saladbench_data
Viewer
•
Updated
Mar 28
•
30.4k
•
56
openbmb/UltraSafety
Viewer
•
Updated
Mar 16
•
3k
•
113
•
27
PKU-Alignment/BeaverTails
Viewer
•
Updated
Oct 17, 2023
•
364k
•
2.88k
•
32
PKU-Alignment/PKU-SafeRLHF
Viewer
•
Updated
23 days ago
•
164k
•
3.55k
•
114
Anthropic/hh-rlhf
Viewer
•
Updated
May 26, 2023
•
169k
•
8.49k
•
1.2k
lmsys/toxic-chat
Viewer
•
Updated
May 14
•
20.3k
•
3.45k
•
135
mmathys/openai-moderation-api-evaluation
Viewer
•
Updated
Aug 28, 2023
•
1.68k
•
234
•
18
allenai/WildChat-1M
Viewer
•
Updated
23 days ago
•
838k
•
1.56k
•
280
allenai/wildjailbreak
Viewer
•
Updated
Aug 8
•
2.21k
•
1.23k
•
23
allenai/wildguardmix
Viewer
•
Updated
Jun 29
•
88.5k
•
2.73k
•
12
allenai/xstest-response
Viewer
•
Updated
Jun 29
•
895
•
460
•
2
walledai/XSTest
Viewer
•
Updated
Jul 4
•
450
•
1.11k
•
3
meta-llama/Llama-Guard-3-8B
Text Generation
•
Updated
30 days ago
•
94k
•
121
Upvote
-
Share collection
View history
Collection guide
Browse collections