Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
mayankagarwal
's Collections
RLHF + Code
RLHF + Code
updated
2 days ago
Upvote
-
Vezora/Code-Preference-Pairs
Viewer
•
Updated
Jul 28
•
54k
•
61
•
14
quangduc1112001/python-code-DPO-fine-tune
Viewer
•
Updated
18 days ago
•
2k
•
42
•
2
xinlai/Math-Step-DPO-10K
Viewer
•
Updated
Jul 4
•
10.8k
•
552
•
32
minfeng-ai/leetcode_preference
Viewer
•
Updated
Sep 6, 2023
•
457
•
10
•
6
Magpie-Align/Magpie-Llama-3.1-Pro-DPO-100K-v0.1
Viewer
•
Updated
Aug 22
•
100k
•
173
•
4
openbmb/UltraInteract_pair
Viewer
•
Updated
Apr 5
•
220k
•
691
•
104
NextWealth/Python-DPO-Large
Viewer
•
Updated
Jul 2
•
957
•
66
Upvote
-
Share collection
View history
Collection guide
Browse collections