arxiv:2406.12066
Mingye Gao
mingye94
AI & ML interests
None yet
Recent Activity
updated
a model
about 4 hours ago
mingye94/pku-safeRLHF-baseline-safety-base
updated
a dataset
about 9 hours ago
mingye94/pku-safeRLHF-softlabel
updated
a dataset
about 11 hours ago
mingye94/pku-safeRLHF-softlabel
Organizations
Papers
1
models
10
mingye94/pku-safeRLHF-baseline-safety-base
Updated
mingye94/pku-safeRLHF-baseline-safety-instruct
Updated
mingye94/pku-safeRLHF-softlabel-safety-instruct
Updated
•
4
mingye94/pku-safeRLHF-softlabel-safety-model
Updated
•
837
mingye94/pku-safeRLHF-softlabel-safety-tokenizer
Updated
mingye94/rm_llama3_8B_helpsteer2
Updated
•
12
mingye94/llama3-8B-Instruct-lr_5e-07_bsz_1
Updated
•
3
mingye94/llama3-8B-Instruct-lr_1e-05_bsz_1
Updated
mingye94/llama3-8B-Instruct-lr_1e-5_bsz_2
Updated
•
4
mingye94/meta-llama-Meta-Llama-3-8B-Instruct_lr_1e-05
Updated
datasets
6
mingye94/pku-safeRLHF-softlabel
Viewer
•
Updated
•
82.1k
•
43
mingye94/ultrafeedback_binarized_with_soft_lable_flipped
Viewer
•
Updated
•
63.1k
•
25
mingye94/ultrafeedback_binarized_with_soft_lable
Viewer
•
Updated
•
63.1k
•
56
mingye94/HelpSteer2_pair
Viewer
•
Updated
•
10.2k
•
89
mingye94/HelpSteer2_pair_val
Viewer
•
Updated
•
519
•
86
mingye94/generic_brand_count_pretrained_data
Viewer
•
Updated
•
1
•
43