arxiv:2410.01428
Fangkai Jiao
chitanda
AI & ML interests
self-supervised pre-training, large language model and machine reasoning.
Organizations
Papers
15
models
69
chitanda/llama2.7b.chat.logiqav2.70b-distil.step.dpo.fix_hack.H100.w4.v1.0.th.s43
Updated
chitanda/llama2.7b.chat.logiqav2.70b-distil.step.dpo.fix_hack.H100.w4.v1.0.th.s42
Updated
chitanda/llama2.7b.chat.logiqav2.70b-distil.step.dpo.fix_hack.A100.w4.v1.0.th.s44
Updated
chitanda/llama2.7b.chat.reclor.gpt35turbo1106.dpo-sft.H100.w4.v2.0
Updated
chitanda/llama2.7b.chat.logiqav2.70b-distil.dpo.fix_hack.H100.w4.v1.0.th.test.s43
Updated
chitanda/llama2.7b.chat.logiqav2.llama-2-70b-chat.dpo-sft.A6K.w4.v1.0
Updated
chitanda/llama2.70b.q_lora.merit_v91_v91.seq2seq.v5.0.6aug.filter.w4.adamw.500steps.NA100.1010
Updated
•
10
chitanda/llama2.7b.chat.reclor.gpt351106.step.dpo.fix_hack.H100.w4.v5.0.s42
Updated
chitanda/llama2.7b.chat.reclor.gpt351106.dpo.fix_hack.H100.w4.v3.0.s42
Updated
chitanda/llama2.7b.chat.logiqav2.70b-distil.step.dpo.fix_hack.H100.w4.v2.10.iter1.s42
Updated