SongTonyLi/Phi-3.5-mini-instruct-SFT-D_chosen-dpo-mix-shuffled Text Generation • Updated 2 days ago • 66
SongTonyLi/Phi-3.5-mini-instruct-SFT-D_chosen-dpo-mix_skywork_infinity Text Generation • Updated 2 days ago • 21
SongTonyLi/Phi-3.5-mini-instruct-SFT-D_chosen-dpo-mix-shuffled3 Text Generation • Updated 2 days ago • 18
SongTonyLi/Phi-3.5-mini-instruct-SFT-D_chosen-dpo-mix-shuffled4 Text Generation • Updated 1 day ago • 24
regent-project/jat-regent-medium-embeddings-checkpoint-27726 Reinforcement Learning • Updated 2 days ago • 2
simpleParadox/seed_1_git_causal_image_caption_only_curriculum_final_model_8_epochs Updated 1 day ago • 15
simpleParadox/seed_2_git_causal_image_caption_only_curriculum_final_model_8_epochs Updated 1 day ago • 13
simpleParadox/seed_1_git_causal_image_caption_only_standard_final_model_8_epochs Updated 1 day ago • 15
simpleParadox/seed_2_git_causal_image_caption_only_standard_final_model_8_epochs Updated 1 day ago • 15
simpleParadox/seed_1_git_causal_initialize_with_text_image_caption_only_curriculum_final_model_8_epochs Updated 1 day ago • 14
simpleParadox/seed_2_git_causal_initialize_with_text_image_caption_only_curriculum_final_model_8_epochs Updated 1 day ago • 15
simpleParadox/seed_1_git_causal_initialize_with_text_image_caption_only_standard_final_model_8_epochs Updated 1 day ago • 13
simpleParadox/seed_2_git_causal_initialize_with_text_image_caption_only_standard_final_model_8_epochs Updated 1 day ago • 15