arxiv:2410.04698
Hanze Dong
hendrydong
AI & ML interests
None yet
Organizations
Papers
11
models
5
hendrydong/dpo_offline_700K
Text Generation
•
Updated
•
8
hendrydong/llama3
Updated
hendrydong/dpo_K8_max_max
Text Generation
•
Updated
•
3
hendrydong/Mistral-RM-for-RAFT-GSHF-v0
Text Classification
•
Updated
•
9
•
1
hendrydong/Mistral-RM-baseline-No-Safety-Alignment
Text Classification
•
Updated
•
10