Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
heegyu
's Collections
Korean Reward Modeling
Korean Pretraining Dataset
RLHF papers
Reward Modeling Datasets
Pre-training Dataset
Vision LM
Image Generation
Domain Specific (Math, Code, etc)
Machine Translation
Safety LM
Text2SQL
Safety LM
updated
10 days ago
Upvote
-
meta-llama/LlamaGuard-7b
Text Generation
•
Updated
Apr 17
•
5.37k
•
203
meta-llama/Meta-Llama-Guard-2-8B
Text Generation
•
Updated
May 13
•
10.8k
•
270
OpenSafetyLab/MD-Judge-v0.1
Text Generation
•
Updated
May 20
•
1.83k
•
13
mcj311/saladbench_data
Viewer
•
Updated
Mar 28
•
30.4k
•
5
openbmb/UltraSafety
Viewer
•
Updated
Mar 16
•
3k
•
76
•
26
PKU-Alignment/BeaverTails
Viewer
•
Updated
Oct 17, 2023
•
364k
•
6.4k
•
30
PKU-Alignment/PKU-SafeRLHF
Viewer
•
Updated
10 days ago
•
164k
•
14.3k
•
107
Anthropic/hh-rlhf
Viewer
•
Updated
May 26, 2023
•
169k
•
44.1k
•
1.17k
lmsys/toxic-chat
Viewer
•
Updated
May 14
•
20.3k
•
5.72k
•
130
mmathys/openai-moderation-api-evaluation
Viewer
•
Updated
Aug 28, 2023
•
1.68k
•
1.76k
•
18
allenai/WildChat-1M
Viewer
•
Updated
14 days ago
•
838k
•
916
•
267
allenai/wildjailbreak
Viewer
•
Updated
Aug 8
•
2.21k
•
1.45k
•
14
allenai/wildguardmix
Viewer
•
Updated
Jun 29
•
88.5k
•
3.71k
•
12
allenai/xstest-response
Viewer
•
Updated
Jun 29
•
895
•
1.72k
•
1
walledai/XSTest
Viewer
•
Updated
Jul 4
•
450
•
861
•
3
meta-llama/Llama-Guard-3-8B
Text Generation
•
Updated
about 1 month ago
•
120k
•
100
Upvote
-
Share collection
View history
Collection guide
Browse collections