iamtarun/python_code_instructions_18k_alpaca Viewer • Updated Jul 27, 2023 • 18.6k • 1.77k • 235
Malikeh1375/medical-question-answering-datasets Viewer • Updated Nov 2, 2023 • 1.26M • 721 • 27
llm-wizard/dolly-15k-instruction-alpaca-format Viewer • Updated Apr 13, 2023 • 15k • 129 • 30
Telugu-LLM-Labs/marathi_alpaca_yahma_cleaned_filtered Viewer • Updated Mar 14 • 28.9k • 42 • 1
Telugu-LLM-Labs/nepali_alpaca_yahma_cleaned_filtered Viewer • Updated Mar 14 • 28.9k • 45 • 5
generative-technologies/synth-ehr-icd10-alpaca-format Viewer • Updated Jun 24 • 379k • 155 • 1
Vanessasml/cybersecurity_32k_instruction_input_output Viewer • Updated Apr 19 • 32.6k • 100 • 12
phyloforfun/HLT_MICH_Angiospermae_SLTPvA_v1-0__OCR-C25-L25-E25-R05 Viewer • Updated Nov 29, 2023 • 10.1M • 70
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo16_2_mix_50_kl_0.1_prm_160m_thr_1.0_seed_3 Viewer • Updated Mar 23 • 568k • 102
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo16_2_mix_50_kl_0.1_prm_160m_thr_0.0_seed_2 Viewer • Updated Mar 22 • 568k • 108
Mitsuki-Sakamoto/alpaca_farm-reward-model-deberta-v3-large-v2-re-preference-64-nsample-2-16_mix_random_seed_3 Viewer • Updated Mar 25 • 40k • 36
Mitsuki-Sakamoto/alpaca_farm-reward-model-deberta-v3-large-v2-re-preference-64-nsample-2-16_mix_random_seed_3_16 Viewer • Updated Mar 26 • 20k • 35
akbargherbal/six_millions_instruction_dataset_for_arabic_llm_ft Viewer • Updated May 20 • 6.37M • 66 • 1
phyloforfun/HLT_MICH_Angiospermae_SLTPvA_v1-0_xlarge__OCR-C35-L35-E100-R01 Viewer • Updated Nov 30, 2023 • 1M • 39
Mitsuki-Sakamoto/alpaca_farm-alpaca_instructions_gen_eval_sft Viewer • Updated Mar 7 • 1.2k • 76
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo16_2_mix_50_kl_0.1_prm_160m_thr_0.1_seed_2 Viewer • Updated Mar 22 • 568k • 84
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo16_2_64_mix_50_kl_0.1_prm_410m_thr_0.3_seed_1 Viewer • Updated Mar 25 • 189k • 49
y1xing/natural_language_prompt_dataset_evaluation_instruct_dataset Viewer • Updated Jul 14 • 276 • 34
Mitsuki-Sakamoto/alpaca_farm-reward-model-deberta-v3-large-v2-re-preference-64-nsample-2-16_mix_random_seed_2_16 Viewer • Updated Mar 26 • 20k • 36
phyloforfun/HLT_MICH_Angiospermae_SLTPvC_v1-0_medium_OCR-C25-L25-E50-R05 Viewer • Updated Mar 15 • 10k • 34 • 1
somosnlp-hackathon-2023/ask2democracy-cfqa-salud-pension Viewer • Updated Apr 11, 2023 • 3.81k • 67 • 3
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo2_100_kl_0.1_prm_160m_thr_1.0_seed_2 Viewer • Updated Mar 25 • 189k • 42
Mitsuki-Sakamoto/alfa-deberta-re-pref-64-fil_self_1.4b_bo16_2_64_mix_50_kl_0.1_prm_160m_thr_0.0_seed_2_t_1.0 Viewer • Updated Mar 26 • 94.6k • 44
y1xing/orpo_llama3_concatenated_data_with_chris_examples_orpo_instruct_dataset Viewer • Updated Jul 6 • 2.64k • 37
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo2_100_kl_0.1_prm_70m_thr_0.1_seed_2 Viewer • Updated Mar 23 • 568k • 122
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo16_2_64_mix_50_kl_0.1_prm_410m_thr_0.1_seed_1 Viewer • Updated Mar 25 • 189k • 65
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo16_2_64_mix_50_kl_0.1_prm_160m_thr_1.0_seed_3 Viewer • Updated Mar 24 • 568k • 247
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo2_100_kl_0.1_prm_160m_thr_1.0_seed_1 Viewer • Updated Mar 25 • 189k • 51
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo2_100_kl_0.1_prm_410m_thr_1.0_seed_3 Viewer • Updated Mar 25 • 189k • 55
Mitsuki-Sakamoto/alpaca_farm-reward-model-deberta-v3-large-v2-re-preference-64-nsample-2 Viewer • Updated Mar 7 • 60k • 44
Mitsuki-Sakamoto/alpaca_farm-reward-model-deberta-v3-large-v2-re-preference-64-nsample-2_random Viewer • Updated Mar 10 • 60k • 55
Mitsuki-Sakamoto/alpaca_farm-reward-model-deberta-v3-large-v2-re-preference-64-nsample-8_random Viewer • Updated Mar 10 • 60k • 39
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo2_100_kl_0.1_prm_160m_thr_1.0_seed_3 Viewer • Updated Mar 21 • 568k • 403
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo2_100_kl_0.1_prm_70m_thr_1.0_seed_2 Viewer • Updated Mar 22 • 568k • 151
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo2_100_kl_0.1_prm_160m_thr_1.0_seed_2 Viewer • Updated Mar 21 • 568k • 203
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo2_100_kl_0.1_prm_70m_thr_0.1_seed_3 Viewer • Updated Mar 22 • 568k • 281
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_3 Viewer • Updated Mar 23 • 568k • 76
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo16_2_64_mix_50_kl_0.1_prm_160m_thr_1.0_seed_1 Viewer • Updated Mar 24 • 189k • 49
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo16_2_64_mix_50_kl_0.1_prm_160m_thr_1.0_seed_2 Viewer • Updated Mar 24 • 511k • 147
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo2_100_kl_0.1_prm_160m_thr_0.3_seed_1 Viewer • Updated Mar 24 • 189k • 53
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo2_100_kl_0.1_prm_410m_thr_0.1_seed_2 Viewer • Updated Mar 25 • 189k • 62
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo2_100_kl_0.1_prm_410m_thr_0.1_seed_3 Viewer • Updated Mar 25 • 189k • 42
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_14m_thr_0.0_seed_2_t_1.0 Viewer • Updated Mar 25 • 568k • 113
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_2_tp_0.5 Viewer • Updated Mar 27 • 568k • 84
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo2_100_kl_0.1_prm_70m_thr_0.3_seed_2 Viewer • Updated Mar 21 • 568k • 128
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo2_100_kl_0.1_prm_160m_thr_0.1_seed_3 Viewer • Updated Mar 25 • 189k • 53
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo2_100_kl_0.1_prm_160m_thr_0.1_seed_2 Viewer • Updated Mar 25 • 189k • 81
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo2_100_kl_0.1_prm_160m_thr_1.0_seed_3 Viewer • Updated Mar 25 • 189k • 42
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo2_100_kl_0.1_prm_410m_thr_0.3_seed_3 Viewer • Updated Mar 25 • 189k • 41
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.3_seed_1_t_1.0 Viewer • Updated Mar 25 • 568k • 75
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.5_seed_3_t_1.0 Viewer • Updated Mar 25 • 568k • 73
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_1.0_seed_3_t_1.0 Viewer • Updated Mar 25 • 568k • 100
Mitsuki-Sakamoto/alfa-deberta-re-pref-64-fil_self_1.4b_bo16_2_64_mix_50_kl_0.1_prm_160m_thr_0.0_seed_3_t_1.0 Viewer • Updated Mar 26 • 94.6k • 49
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_1_t_0.5 Viewer • Updated Mar 26 • 568k • 92
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_1_t_0.25 Viewer • Updated Mar 26 • 568k • 84
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_1_tp_0.7 Viewer • Updated Mar 27 • 568k • 75
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_4 Viewer • Updated Apr 26 • 303k • 60
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo2_100_kl_0.1_prm_70m_thr_0.0_seed_4 Viewer • Updated Apr 26 • 303k • 100
vinhtran2611/ArtifactAI_arxiv-physics-instruct-tune-30k_formated Viewer • Updated Jun 7 • 30.2k • 36
vinhtran2611/arxiv-physics-instruct-tune-30k_filtered_formated Viewer • Updated Jun 17 • 324 • 35
Mitsuki-Sakamoto/alpaca_farm-deberta-re-preference-64-nsample-2_filter_gold_thr_0.2_self_70m Viewer • Updated Mar 14 • 37.9k • 43
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo2_100_kl_0.1_prm_70m_thr_1.0_seed_1 Viewer • Updated Mar 21 • 568k • 97
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo16_2_mix_50_kl_0.1_prm_160m_thr_0.1_seed_1 Viewer • Updated Mar 22 • 568k • 274
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo16_2_64_mix_50_kl_0.1_prm_160m_thr_0.1_seed_2 Viewer • Updated Mar 25 • 568k • 106
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo16_2_64_mix_50_kl_0.1_prm_410m_thr_0.3_seed_2 Viewer • Updated Mar 25 • 189k • 60
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.3_seed_3_t_1.0 Viewer • Updated Mar 25 • 568k • 88
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_2_t_0.25 Viewer • Updated Mar 26 • 568k • 67
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_2_t_0.9 Viewer • Updated Mar 26 • 568k • 62
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_3_t_0.9 Viewer • Updated Mar 26 • 568k • 78
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo2_100_kl_0.1_prm_70m_thr_0.0_seed_2_t_1.0 Viewer • Updated Mar 27 • 568k • 218
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_3_tp_0.3 Viewer • Updated Mar 27 • 568k • 156
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_3_tp_0.5 Viewer • Updated Mar 27 • 568k • 83
Mitsuki-Sakamoto/fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.3_seed_1_t_1.0_eval Viewer • Updated Mar 30 • 568k • 174
Telugu-LLM-Labs/sindhi_alpaca_yahma_cleaned_filtered Viewer • Updated Mar 14 • 28.9k • 38 • 2
Telugu-LLM-Labs/assamese_alpaca_yahma_cleaned_filtered Viewer • Updated Mar 14 • 28.9k • 47 • 1
phyloforfun/HLT_Kew_WCVP_SLTPvA_v1-0_tiny__T20-OCR-C25-L25-E50-R10 Viewer • Updated Dec 1, 2023 • 100 • 34
Mitsuki-Sakamoto/alpaca_farm-deberta-re-preference-64-nsample-2_filter_gold_thr_0.2_self_160m Viewer • Updated Mar 14 • 37.9k • 36
Mitsuki-Sakamoto/alpaca_farm-deberta-re-preference-64-nsample-12_filter_gold_thr_0.1_self_160m Viewer • Updated Mar 21 • 37.9k • 40
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo2_100_kl_0.1_prm_70m_thr_0.3_seed_1 Viewer • Updated Mar 21 • 568k • 199
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo2_100_kl_0.1_prm_70m_thr_0.3_seed_3 Viewer • Updated Mar 21 • 568k • 177
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo2_100_kl_0.1_prm_160m_thr_0.3_seed_3 Viewer • Updated Mar 21 • 568k • 120
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.3_seed_1 Viewer • Updated Mar 23 • 568k • 214
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo16_2_mix_50_kl_0.1_prm_160m_thr_1.0_seed_1 Viewer • Updated Mar 22 • 568k • 114
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo16_2_mix_50_kl_0.1_prm_160m_thr_1.0_seed_2 Viewer • Updated Mar 23 • 568k • 129
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.1_seed_3 Viewer • Updated Mar 23 • 568k • 52
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo16_2_mix_50_kl_0.1_prm_160m_thr_0.3_seed_3 Viewer • Updated Mar 23 • 568k • 92
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo16_2_64_mix_50_kl_0.1_prm_410m_thr_0.1_seed_2 Viewer • Updated Mar 24 • 189k • 62
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo16_2_64_mix_50_kl_0.1_prm_160m_thr_0.3_seed_2 Viewer • Updated Mar 24 • 189k • 56
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo16_2_64_mix_50_kl_0.1_prm_160m_thr_0.3_seed_3 Viewer • Updated Mar 24 • 568k • 144
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo2_100_kl_0.1_prm_410m_thr_0.3_seed_2 Viewer • Updated Mar 25 • 189k • 41
Mitsuki-Sakamoto/alfa-deberta-re-pref-64-fil_self_1.4b_bo16_2_64_mix_50_kl_0.1_prm_160m_thr_0.0_seed_1_t_1.0 Viewer • Updated Mar 26 • 94.6k • 38
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo2_100_kl_0.1_prm_70m_thr_0.0_seed_3_t_1.0 Viewer • Updated Mar 27 • 568k • 82
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_1_tp_0.9 Viewer • Updated Mar 27 • 568k • 165
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_2_tp_0.9 Viewer • Updated Mar 27 • 568k • 99
Mitsuki-Sakamoto/fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.5_seed_3_t_1.0_eval Viewer • Updated Mar 30 • 568k • 117
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_3 Viewer • Updated Apr 26 • 303k • 88
gogo8232/experiment_perplexity_instruction_llama3_8b_response Viewer • Updated Jul 5 • 34.9k • 36
oliverwang15/fingpt_chatglm2_sentiment_instruction_lora_ft_dataset Viewer • Updated Jul 11, 2023 • 67.2k • 40 • 9
lucasmccabe-lmi/sql-create-context_alpaca_style Viewer • Updated May 15, 2023 • 78.6k • 44 • 5
japneets/Alpaca_instruction_fine_tune_Punjabi_small Viewer • Updated Apr 16, 2023 • 10k • 41 • 1
filopedraz/swedish-sentiment-instruction-fine-tuning Viewer • Updated Jun 13, 2023 • 164k • 38 • 1
anton96vice/samantha-1.1-uncensored-split-and-prepared Viewer • Updated Mar 7 • 2.04k • 39 • 1
Telugu-LLM-Labs/konkani_alpaca_yahma_cleaned_filtered Viewer • Updated Mar 14 • 28.9k • 62 • 1
Hadnet/olavo-article-17k-llama2-chat-dataset-text Viewer • Updated Sep 25, 2023 • 17.4k • 40 • 1
UMCU/WikiDocPatientInformation_Dutch_translated_with_MariaNMT Viewer • Updated Jan 22 • 5.76k • 46
Cesar7980/fingpt_chatglm2_sentiment_instruction_lora_ft_dataset Viewer • Updated Nov 8, 2023 • 76.8k • 40
rodrfons/fingpt_chatglm2_sentiment_instruction_lora_ft_dataset Viewer • Updated Nov 18, 2023 • 76.8k • 36
phyloforfun/HLT_MICH_Angiospermae_SLTPvA_v1.0_OCR-C25-L25-E50-R10 Viewer • Updated Nov 29, 2023 • 230 • 31
phyloforfun/HLT_MICH_Angiospermae_SLTPvA_v1-0__OCR-C35-L35-E100-R01 Viewer • Updated Nov 30, 2023 • 10.1M • 133
phyloforfun/HLT_MICH_Angiospermae_SLTPvA_v1-0_tiny__OCR-C35-L35-E100-R01 Viewer • Updated Nov 30, 2023 • 87 • 34
phyloforfun/HLT_MICH_Angiospermae_SLTPvA_v1-0_large__OCR-C35-L35-E100-R01 Viewer • Updated Nov 30, 2023 • 100k • 38
phyloforfun/HLT_MICH_Angiospermae_SLTPvA_v1-0_medium__OCR-C25-L25-E50-R05 Viewer • Updated Nov 30, 2023 • 10k • 34
phyloforfun/HLT_Kew_WCVP_SLTPvA_v1-0_large__T20-OCR-C25-L25-E50-R10 Viewer • Updated Dec 1, 2023 • 100k • 34
mfmezger/sandboxai_german_to_english_translations_seperated Viewer • Updated Feb 15 • 1.35M • 46
Mitsuki-Sakamoto/alpaca_farm-deberta-re-preference-64-nsample-2_filter_gold_thr_0.5_self_160m Viewer • Updated Mar 14 • 37.9k • 56
Mitsuki-Sakamoto/alpaca_farm-deberta-re-preference-64-nsample-12_filter_gold_thr_0.3_self_160m Viewer • Updated Mar 21 • 37.9k • 34
Mitsuki-Sakamoto/alpaca_farm-deberta-re-preference-64-nsample-12_filter_gold_thr_1.0_self_160m Viewer • Updated Mar 21 • 18.9k • 32
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo2_100_kl_0.1_prm_70m_thr_0.1_seed_1 Viewer • Updated Mar 21 • 568k • 95
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo2_100_kl_0.1_prm_160m_thr_0.1_seed_3 Viewer • Updated Mar 21 • 568k • 201
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo2_100_kl_0.1_prm_160m_thr_0.3_seed_1 Viewer • Updated Mar 23 • 568k • 439
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo2_100_kl_0.1_prm_160m_thr_1.0_seed_1 Viewer • Updated Mar 21 • 568k • 244
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo2_100_kl_0.1_prm_160m_thr_0.1_seed_2 Viewer • Updated Mar 21 • 568k • 96
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo16_2_64_mix_50_kl_0.1_prm_160m_thr_0.1_seed_1 Viewer • Updated Mar 24 • 568k • 237
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_1 Viewer • Updated Mar 22 • 568k • 238
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo16_2_mix_50_kl_0.1_prm_160m_thr_0.0_seed_1 Viewer • Updated Mar 22 • 568k • 67
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.1_seed_2 Viewer • Updated Mar 22 • 568k • 77
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo16_2_mix_50_kl_0.1_prm_160m_thr_0.3_seed_2 Viewer • Updated Mar 22 • 568k • 190
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo16_2_mix_50_kl_0.1_prm_160m_thr_0.0_seed_3 Viewer • Updated Mar 23 • 568k • 174
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.3_seed_3 Viewer • Updated Mar 23 • 568k • 99
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo16_2_64_mix_50_kl_0.1_prm_410m_thr_1.0_seed_2 Viewer • Updated Mar 24 • 189k • 59
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo16_2_64_mix_50_kl_0.1_prm_410m_thr_0.1_seed_3 Viewer • Updated Mar 24 • 189k • 92
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo16_2_64_mix_50_kl_0.1_prm_410m_thr_1.0_seed_3 Viewer • Updated Mar 24 • 189k • 37
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo16_2_64_mix_50_kl_0.1_prm_410m_thr_0.3_seed_3 Viewer • Updated Mar 24 • 189k • 68
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo2_100_kl_0.1_prm_410m_thr_0.3_seed_1 Viewer • Updated Mar 25 • 189k • 64
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo2_100_kl_0.1_prm_160m_thr_0.3_seed_2 Viewer • Updated Mar 25 • 189k • 46
Mitsuki-Sakamoto/alpaca_farm-reward-model-deberta-v3-large-v2-re-preference-64-nsample-2-16_mix_random_seed_1 Viewer • Updated Mar 25 • 40k • 36
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_14m_thr_0.0_seed_3_t_1.0 Viewer • Updated Mar 25 • 568k • 84
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.5_seed_2_t_1.0 Viewer • Updated Mar 25 • 568k • 79
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.3_seed_2_t_1.0 Viewer • Updated Mar 25 • 568k • 69
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.1_seed_3_t_1.0 Viewer • Updated Mar 25 • 568k • 128
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_160m_thr_0.0_seed_2_t_1.0 Viewer • Updated Mar 26 • 568k • 149
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_2_t_0.75 Viewer • Updated Mar 26 • 568k • 110
Mitsuki-Sakamoto/fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_1_t_1.0_eval Viewer • Updated Mar 28 • 568k • 269
Mitsuki-Sakamoto/fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.1_seed_1_t_1.0_eval Viewer • Updated Mar 29 • 568k • 105
Mitsuki-Sakamoto/fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.1_seed_2_t_1.0_eval Viewer • Updated Mar 29 • 568k • 265
Mitsuki-Sakamoto/fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.3_seed_2_t_1.0_eval Viewer • Updated Mar 30 • 568k • 282
thusinh1969/llama-2-7b-LongContext-mixed-64k-30APRIL2024 Viewer • Updated May 1 • 81.8k • 51 • 1
HachiML/oasst1_for_self-rewarding_EFT_Mixtral-8x22B-Instruct Viewer • Updated May 29 • 5.24k • 37
murugeshmarvel/a5d87d8c1326b4f0c531065dbe7f5068a2bab8a56edc9a9d4aab95be427bb171 Viewer • Updated Jun 5 • 95k • 32
generative-technologies/synth-ehr-icd10-llama3-format Viewer • Updated Jun 23 • 379k • 105 • 1
phyloforfun/HLT_MICH_Angiospermae_SLTPvA_v1-0_small__OCR-C35-L35-E100-R01 Viewer • Updated Nov 30, 2023 • 1.01k • 38
phyloforfun/HLT_Kew_WCVP_SLTPvA_v1-0_medium__T20-OCR-C25-L25-E50-R10 Viewer • Updated Dec 1, 2023 • 10k • 38
phyloforfun/HLT_Kew_WCVP_SLTPvA_v1-0_full__T20-OCR-C25-L25-E50-R10 Viewer • Updated Dec 1, 2023 • 1.42M • 48
Mitsuki-Sakamoto/alpaca_farm-deberta-re-preference-64-nsample-2_filter_gold_thr_0.1_self_70m Viewer • Updated Mar 14 • 37.9k • 39
Mitsuki-Sakamoto/alpaca_farm-deberta-re-preference-64-nsample-2_filter_gold_thr_0.1_self_160m Viewer • Updated Mar 14 • 37.9k • 36
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo2_100_kl_0.1_prm_70m_thr_1.0_seed_3 Viewer • Updated Mar 21 • 568k • 163
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.1_seed_1 Viewer • Updated Mar 22 • 568k • 74
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_1.0_seed_1 Viewer • Updated Mar 22 • 568k • 114
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_1.0_seed_2 Viewer • Updated Mar 23 • 568k • 95
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_1.0_seed_3 Viewer • Updated Mar 23 • 568k • 144
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo16_2_mix_50_kl_0.1_prm_160m_thr_0.1_seed_3 Viewer • Updated Mar 23 • 568k • 70
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo16_2_64_mix_50_kl_0.1_prm_160m_thr_0.3_seed_1 Viewer • Updated Mar 24 • 568k • 244
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo16_2_64_mix_50_kl_0.1_prm_410m_thr_1.0_seed_1 Viewer • Updated Mar 24 • 189k • 47
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo2_100_kl_0.1_prm_410m_thr_0.1_seed_1 Viewer • Updated Mar 25 • 189k • 44
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo2_100_kl_0.1_prm_410m_thr_1.0_seed_1 Viewer • Updated Mar 25 • 189k • 41
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo2_100_kl_0.1_prm_410m_thr_1.0_seed_2 Viewer • Updated Mar 25 • 189k • 61
Mitsuki-Sakamoto/alpaca_farm-reward-model-deberta-v3-large-v2-re-preference-64-nsample-2-16_mix_random_seed_2 Viewer • Updated Mar 25 • 40k • 40
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_14m_thr_0.0_seed_1_t_1.0 Viewer • Updated Mar 25 • 568k • 99
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_3_t_1.0 Viewer • Updated Apr 19 • 568k • 150
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.1_seed_1_t_1.0 Viewer • Updated Mar 25 • 568k • 148
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.1_seed_2_t_1.0 Viewer • Updated Mar 25 • 568k • 70
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_1.0_seed_2_t_1.0 Viewer • Updated Mar 25 • 568k • 147
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_1_t_0.9 Viewer • Updated Mar 26 • 568k • 96
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_2_t_0.5 Viewer • Updated Mar 26 • 568k • 127
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_3_t_0.5 Viewer • Updated Mar 26 • 568k • 186
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo2_100_kl_0.1_prm_70m_thr_0.0_seed_1_t_1.0 Viewer • Updated Mar 27 • 568k • 53
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_1_tp_0.1 Viewer • Updated Mar 27 • 568k • 129
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_2_tp_0.3 Viewer • Updated Mar 27 • 568k • 114
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_3_tp_0.1 Viewer • Updated Mar 27 • 568k • 166
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_3_tp_0.7 Viewer • Updated Mar 27 • 568k • 169
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_3_tp_0.9 Viewer • Updated Mar 27 • 568k • 105
Mitsuki-Sakamoto/fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_3_t_1.0_eval Viewer • Updated Mar 28 • 568k • 227
Mitsuki-Sakamoto/fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.5_seed_1_t_1.0_eval Viewer • Updated Mar 30 • 568k • 131
Mitsuki-Sakamoto/fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_1.0_seed_2_t_1.0_eval Viewer • Updated Mar 30 • 568k • 235
Mitsuki-Sakamoto/fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_1.0_seed_3_t_1.0_eval Viewer • Updated Mar 30 • 568k • 155
Mitsuki-Sakamoto/alpaca_farm-RM-Mistral-7B-re-preference-256-nsample-2 Viewer • Updated Apr 15 • 20k • 34
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_1 Viewer • Updated Apr 26 • 303k • 118
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_5 Viewer • Updated Apr 26 • 303k • 121
phyloforfun/HLT_MICH_Angiospermae_SLTPvA_v1-0_medium__OCR-C35-L35-E100-R01 Viewer • Updated Nov 30, 2023 • 10k • 37
phyloforfun/HLT_Kew_WCVP_SLTPvA_v1-0_small__T20-OCR-C25-L25-E50-R10 Viewer • Updated Dec 1, 2023 • 1k • 37
Mitsuki-Sakamoto/alpaca_farm-deberta-re-preference-64-nsample-2_filter_gold_thr_0.5_self_70m Viewer • Updated Mar 14 • 37.9k • 41
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo2_100_kl_0.1_prm_160m_thr_0.1_seed_1 Viewer • Updated Mar 21 • 568k • 118
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo2_100_kl_0.1_prm_160m_thr_0.3_seed_2 Viewer • Updated Mar 21 • 568k • 146
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo16_2_mix_50_kl_0.1_prm_160m_thr_0.3_seed_1 Viewer • Updated Mar 22 • 568k • 108
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_2 Viewer • Updated Mar 22 • 568k • 95
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.3_seed_2 Viewer • Updated Mar 22 • 568k • 142
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo16_2_64_mix_50_kl_0.1_prm_160m_thr_0.1_seed_3 Viewer • Updated Mar 24 • 568k • 140
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo2_100_kl_0.1_prm_160m_thr_0.1_seed_1 Viewer • Updated Mar 25 • 189k • 58
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo2_100_kl_0.1_prm_160m_thr_0.3_seed_3 Viewer • Updated Mar 25 • 189k • 39
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_1_t_1.0 Viewer • Updated Apr 19 • 568k • 222
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_1.0_seed_1_t_1.0 Viewer • Updated Mar 25 • 568k • 157
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_160m_thr_0.0_seed_3_t_1.0 Viewer • Updated Mar 25 • 568k • 77
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.5_seed_1_t_1.0 Viewer • Updated Mar 25 • 568k • 124
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_3_t_0.75 Viewer • Updated Mar 26 • 568k • 128
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_1_tp_0.3 Viewer • Updated Mar 27 • 568k • 148
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_1_tp_0.5 Viewer • Updated Mar 27 • 568k • 127
Mitsuki-Sakamoto/fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_1.0_seed_1_t_1.0_eval Viewer • Updated Mar 30 • 568k • 291
Mitsuki-Sakamoto/fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.5_seed_2_t_1.0_eval Viewer • Updated Mar 30 • 568k • 111
Mitsuki-Sakamoto/fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.3_seed_3_t_1.0_eval Viewer • Updated Mar 30 • 568k • 447
y1xing/natural_language_prompt_synthetic_dataset_evaluation_instruct_dataset Viewer • Updated Jul 14 • 435 • 32
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_160m_thr_0.0_seed_1_t_1.0 Viewer • Updated Mar 25 • 568k • 80
Mitsuki-Sakamoto/alpaca_farm-reward-model-deberta-v3-large-v2-re-preference-64-nsample-2-16_mix_random_seed_1_16 Viewer • Updated Mar 26 • 20k • 38
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_1_t_0.75 Viewer • Updated Mar 26 • 568k • 127
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_3_t_0.25 Viewer • Updated Mar 26 • 568k • 111
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_2_tp_0.1 Viewer • Updated Mar 27 • 568k • 67
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_2_tp_0.7 Viewer • Updated Mar 27 • 568k • 111
Mitsuki-Sakamoto/fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_2_t_1.0_eval Viewer • Updated Mar 28 • 568k • 132
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_2 Viewer • Updated Apr 26 • 303k • 63
y1xing/llama3_concatenated_data_with_chris_examples_orpo_instruct_dataset Viewer • Updated Jul 6 • 2.64k • 32
y1xing/llama_chris_examples_generated_synthetic_data_instruct_dataset Viewer • Updated Jul 13 • 1.85k • 32
y1xing/partially_correct_llama_all_synthetic_data_instruct_dataset Viewer • Updated Jul 14 • 1.53k • 32
y1xing/llama_all_synthetic_dataset_evaluation_instruct_dataset Viewer • Updated Jul 14 • 435 • 32
Mitsuki-Sakamoto/alpaca_farm-alpaca_gpt4_preference-re-preference_eval Viewer • Updated Jan 15 • 197k • 31
Mitsuki-Sakamoto/alpaca_farm-alpaca_instructions-re-preference Viewer • Updated Jan 17 • 22k • 107
Mitsuki-Sakamoto/alpaca_farm-reward-model-deberta-v3-large-v2-re-eval-preference Viewer • Updated Feb 5 • 2k • 34
Mitsuki-Sakamoto/alpaca_farm-reward-model-deberta-v3-large-v2-are-preference-256 Viewer • Updated Mar 1 • 22k • 35
Mitsuki-Sakamoto/alpaca_farm-reward-model-deberta-v3-large-v2-test Viewer • Updated Apr 19 • 40 • 35
Mitsuki-Sakamoto/alpaca_farm-reward-model-deberta-v3-large-v2-re-preference-256-nsample-4 Updated Mar 6 • 33
Mitsuki-Sakamoto/alpaca_farm-reward-model-deberta-v3-large-v2-re-preference-256-nsample-8 Viewer • Updated Mar 6 • 20k • 33
Mitsuki-Sakamoto/alpaca_farm-reward-model-deberta-v3-large-v2-re-preference-256-nsample-16 Viewer • Updated Mar 7 • 20k • 33
Mitsuki-Sakamoto/alpaca_farm-reward-model-deberta-v3-large-v2-re-preference-64-nsample-16_random Viewer • Updated Mar 10 • 60k • 90
Mitsuki-Sakamoto/alpaca_farm-deberta-re-preference-64-nsample-16_filter_gold_thr_0.2_self_70m Viewer • Updated Mar 15 • 37.9k • 103
Mitsuki-Sakamoto/alpaca_farm-deberta-re-preference-64-nsample-16_filter_gold_thr_0.1_self_70m Viewer • Updated Mar 18 • 189k • 33
Mitsuki-Sakamoto/alpaca_farm-deberta-re-preference-64-nsample-16_filter_gold_thr_0.5_self_70m Viewer • Updated Mar 18 • 189k • 34
Mitsuki-Sakamoto/alpaca_farm-deberta-re-preference-64-nsample-16_filter_gold_thr_0.1_self_160m Updated Mar 21 • 33
Mitsuki-Sakamoto/alpaca_farm-deberta-re-preference-64-nsample-16_filter_gold_thr_0.5_self_160m Updated Mar 18 • 33
Mitsuki-Sakamoto/alpaca_farm-deberta-re-preference-64-nsample-16_filter_gold_thr_0.2_self_160m Viewer • Updated Mar 15 • 37.9k • 33
Mitsuki-Sakamoto/alpaca_farm-deberta-re-preference-64-nsample-16_filter_gold_thr_0.0_self_70m Viewer • Updated Mar 18 • 189k • 32
Mitsuki-Sakamoto/alpaca_farm-deberta-re-preference-64-nsample-16_filter_gold_thr_0.0_self_160m Viewer • Updated Mar 18 • 189k • 33
Mitsuki-Sakamoto/alpaca_farm-deberta-re-preference-64-nsample-2_iso_filter_gold_thr_0.5_self_70m Viewer • Updated Mar 19 • 189k • 77
Mitsuki-Sakamoto/alpaca_farm-deberta-re-preference-64-nsample-2_iso_filter_gold_thr_0.1_self_70m Viewer • Updated Mar 19 • 189k • 33
Mitsuki-Sakamoto/alpaca_farm-deberta-re-preference-64-nsample-2_iso_filter_gold_thr_0.0_self_70m Viewer • Updated Mar 19 • 189k • 32
Mitsuki-Sakamoto/alpaca_farm-deberta-re-preference-64-nsample-2_iso_filter_gold_thr_0.1_self_160m Updated Mar 19 • 32
Mitsuki-Sakamoto/alpaca_farm-deberta-re-preference-64-nsample-2_iso_filter_gold_thr_0.5_self_160m Updated Mar 19 • 32
Mitsuki-Sakamoto/alpaca_farm-deberta-re-preference-64-nsample-2_iso_filter_gold_thr_0.0_self_160m Updated Mar 19 • 33
Mitsuki-Sakamoto/alpaca_farm-deberta-re-preference-64-nsample-16_filter_gold_thr_0.3_self_160m Updated Mar 21 • 36
Mitsuki-Sakamoto/alpaca_farm-deberta-re-preference-64-nsample-16_filter_gold_thr_1.0_self_160m Updated Mar 21 • 33
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_2_t_1.0 Viewer • Updated Apr 19 • 568k • 34
Mitsuki-Sakamoto/fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.1_seed_3_t_1.0_eval Viewer • Updated Mar 29 • 568k • 32
Mitsuki-Sakamoto/alpaca_farm-reward-model-deberta-v3-large-v2-re-preference-64-nsample-2-16_mix_random_seed_4 Viewer • Updated Apr 25 • 40k • 32
Mitsuki-Sakamoto/alpaca_farm-reward-model-deberta-v3-large-v2-re-preference-64-nsample-2-16_mix_random_seed_5 Viewer • Updated Apr 25 • 40k • 32
Mitsuki-Sakamoto/alpaca_farm-reward-model-deberta-v3-large-v2-test-alpaca-gen Viewer • Updated May 12 • 20 • 34
Karmukilan/Malikeh1375_medical-question-answering-datasets Viewer • Updated Jul 16 • 1k • 44 • 2
y1xing/natural_language_prompt_w_correct_ans_dataset_evaluation_instruct_dataset Viewer • Updated Jul 26 • 276 • 34
y1xing/natural_language_prompt_w_correct_ans_synthetic_dataset_evaluation_instruct_dataset Viewer • Updated Jul 26 • 435 • 34
y1xing/natural_language_prompt_w_correct_ans_dataset_json_evaluation_instruct_dataset Viewer • Updated Jul 29 • 276 • 41
y1xing/natural_language_prompt_w_correct_ans_synthetic_dataset_evaluation_json_instruct_dataset Viewer • Updated Jul 29 • 435 • 33
y1xing/natural_language_prompt_w_correct_ans_dataset_training_instruct_dataset Viewer • Updated Jul 30 • 2.99k • 35
UMCU/MedicalFlashCards_Dutch_translated_with_MariaNMT Viewer • Updated Oct 31, 2023 • 32.9k • 34
Mitsuki-Sakamoto/sft_alpaca_pythia-1.4b-use_response_template-deberta-v3 Viewer • Updated Aug 1 • 20k • 33
Mitsuki-Sakamoto/sft_alpaca_pythia-160m-use_response_template-deberta-v3 Viewer • Updated Aug 1 • 20k • 33
purulalwani/Synthetic-Financial-Datasets-For-Fraud-Detection-Cleaned Viewer • Updated Aug 8 • 6.36M • 40
purulalwani/Synthetic-Financial-Datasets-For-Fraud-Detection-Cleaned-Split Viewer • Updated Aug 8 • 6.36M • 38
louisbrulenaudet/code-pensions-civiles-militaires-retraite Viewer • Updated about 16 hours ago • 257 • 91
louisbrulenaudet/code-disciplinaire-penal-marine-marchande Viewer • Updated about 16 hours ago • 6 • 108
louisbrulenaudet/code-domaine-public-fluvial-navigation-interieure Viewer • Updated about 16 hours ago • 2 • 111
louisbrulenaudet/code-action-sociale-familles Viewer • Updated about 16 hours ago • 3.65k • 101
louisbrulenaudet/code-domaine-etat-collectivites-mayotte Viewer • Updated about 16 hours ago • 3 • 77
louisbrulenaudet/code-legion-honneur-medaille-militaire-ordre-national-merite Viewer • Updated about 16 hours ago • 224 • 80
louisbrulenaudet/code-propriete-personnes-publiques Viewer • Updated about 16 hours ago • 1.13k • 110
louisbrulenaudet/code-postes-communications-electroniques Viewer • Updated about 16 hours ago • 730 • 111
louisbrulenaudet/code-instruments-monetaires-medailles Viewer • Updated about 16 hours ago • 6 • 101
arcee-globe/Evaluated_CohereForAI-aya_collection-aya_dataset Viewer • Updated Aug 20 • 14k • 38
Epic3123/election_misinformation_sleeper_agents_dataset_llama27b Viewer • Updated Aug 29 • 733 • 43
FoxySapiens/teknofest-egitim-hukuk-tarim-surdurulebilirlik-dataset Viewer • Updated Sep 7 • 233k • 39
DLI-Lab/Mind2Web-cleaned-lite-reward-model-w-cot-formatted Viewer • Updated Sep 15 • 6.13k • 32
DLI-Lab/Mind2Web-cleaned-lite-value-model-w-cot-formatted-test Viewer • Updated Sep 18 • 6.13k • 31
DLI-Lab/Mind2Web-cleaned-lite-reward-model-w-cot-formatted-v2 Viewer • Updated Sep 19 • 6.13k • 33
tayyibsupercool/resource_allocation_telecom_spectral_efficiency_instruct Viewer • Updated Sep 27 • 100k • 32 • 1
tayyibsupercool/resource_allocation_telecom_energy_efficiency_instruct Viewer • Updated Sep 27 • 100k • 39 • 1
DLI-Lab/Mind2Web-cleaned-lite-acctree-value-model-w-cot-formatted Viewer • Updated Sep 26 • 6.13k • 32
JiaweiGuo123/Alpaca-gpt4-English-with-gsm8k-semantic-similarity Viewer • Updated Oct 2 • 52k • 31
aamina/channel_gains_vs_tx_powers_ee_augmented_with_context_10k Viewer • Updated Oct 4 • 10k • 31
Self-GRIT/open-hermes-2.5-sft-llama3-inference-query-reformulation-tokens Viewer • Updated Oct 4 • 33.3k • 32
aamina/channel_gains_vs_tx_powers_ee_augmented_with_30_examples_context_10k Viewer • Updated Oct 5 • 10k • 31
JiaweiGuo123/Alpaca-gpt4-English-with-humaneval-structure-similarity Viewer • Updated Oct 9 • 52k • 37
zyusc/Alpaca-gpt4-English-with-humaneval-structure-similarity Viewer • Updated Oct 10 • 52k • 33
tayyibsupercool/resource_allocation_telecom_energy_efficiency_3_users_instruct Viewer • Updated Oct 13 • 1.25k • 34
tayyibsupercool/resource_allocation_telecom_spectral_efficiency_3_users_instruct Viewer • Updated Oct 13 • 1.25k • 46
tayyibsupercool/resource_allocation_telecom_energy_efficiency_2_users_rician_fading_instruct Viewer • Updated Oct 10 • 1k • 32
tayyibsupercool/resource_allocation_telecom_spectral_efficiency_2_users_rician_fading_instruct Viewer • Updated Oct 10 • 1k • 30
JiaweiGuo123/Alpaca-gpt4-English-with-humaneval-structure-similarity-optimize Viewer • Updated Oct 10 • 802 • 33
JiaweiGuo123/Alpaca-gpt4-English-with-humaneval-code-sementic-similarity Viewer • Updated Oct 10 • 802 • 43
JiaweiGuo123/Alpaca-gpt4-English-with-humaneval-structure-similarity-without-comment Viewer • Updated Oct 11 • 802 • 35
tayyibsupercool/resource_allocation_telecom_energy_efficiency_rician_k_5_instruct Viewer • Updated Oct 11 • 1.25k • 31
tayyibsupercool/resource_allocation_telecom_spectral_efficiency_rician_k_5_instruct Viewer • Updated Oct 11 • 1.25k • 31
tayyibsupercool/resource_allocation_telecom_energy_efficiency_rician_k_4_instruct Viewer • Updated Oct 12 • 1.25k • 32
tayyibsupercool/resource_allocation_telecom_spectral_efficiency_rician_k_4_instruct Viewer • Updated Oct 12 • 1.25k • 33
tayyibsupercool/resource_allocation_telecom_energy_efficiency_rician_k_8_instruct Viewer • Updated Oct 12 • 1.25k • 32
tayyibsupercool/resource_allocation_telecom_spectral_efficiency_rician_k_8_instruct Viewer • Updated Oct 12 • 1.25k • 33
tayyibsupercool/resource_allocation_telecom_energy_efficiency_rician_k_12_instruct Viewer • Updated Oct 12 • 1.25k • 33
tayyibsupercool/resource_allocation_telecom_spectral_efficiency_rician_k_12_instruct Viewer • Updated Oct 12 • 1.25k • 32
tayyibsupercool/resource_allocation_telecom_energy_efficiency_rician_k_2_instruct Viewer • Updated Oct 12 • 1.25k • 42
tayyibsupercool/resource_allocation_telecom_spectral_efficiency_rician_k_2_instruct Viewer • Updated Oct 12 • 1.25k • 34
tayyibsupercool/resource_allocation_telecom_energy_efficiency_rician_k_6_instruct Viewer • Updated Oct 12 • 1.25k • 36
tayyibsupercool/resource_allocation_telecom_spectral_efficiency_rician_k_6_instruct Viewer • Updated Oct 12 • 1.25k • 32
tayyibsupercool/resource_allocation_telecom_energy_efficiency_rician_k_10_instruct Viewer • Updated Oct 12 • 1.25k • 40
tayyibsupercool/resource_allocation_telecom_spectral_efficiency_rician_k_10_instruct Viewer • Updated Oct 12 • 1.25k • 41
aamina/channel_gains_vs_tx_powers_ee_augmented_with_300_examples_context Viewer • Updated Oct 13 • 10k • 36
tayyibsupercool/resource_allocation_telecom_energy_efficiency_area_500_instruct Viewer • Updated Oct 13 • 12.5k • 33
tayyibsupercool/resource_allocation_telecom_spectral_efficiency_area_500_instruct Viewer • Updated Oct 13 • 12.5k • 34
tayyibsupercool/resource_allocation_telecom_energy_efficiency_30_area_instruct Viewer • Updated Oct 13 • 12.5k • 32
tayyibsupercool/resource_allocation_telecom_spectral_efficiency_30_area_instruct Viewer • Updated Oct 13 • 12.5k • 33
aamina/channel_gains_vs_tx_powers_ee_augmented_with_100_examples_context Viewer • Updated Oct 13 • 10k • 38
tayyibsupercool/resource_allocation_telecom_energy_efficiency_area_150_instruct Viewer • Updated Oct 13 • 12.5k • 32
tayyibsupercool/resource_allocation_telecom_spectral_efficiency_area_150_instruct Viewer • Updated Oct 13 • 12.5k • 32
tayyibsupercool/resource_allocation_telecom_energy_efficiency_area_250_instruct Viewer • Updated Oct 13 • 12.5k • 33
tayyibsupercool/resource_allocation_telecom_spectral_efficiency_area_250_instruct Viewer • Updated Oct 13 • 12.5k • 32
tayyibsupercool/resource_allocation_telecom_energy_efficiency_area_350_instruct Viewer • Updated Oct 13 • 12.5k • 33
tayyibsupercool/resource_allocation_telecom_spectral_efficiency_area_350_instruct Viewer • Updated Oct 13 • 12.5k • 32
tayyibsupercool/resource_allocation_telecom_energy_efficiency_rician_k_2_instruct_10k Viewer • Updated 28 days ago • 12.5k • 38
tayyibsupercool/resource_allocation_telecom_spectral_efficiency_rician_k_2_instruct_10k Viewer • Updated 28 days ago • 12.5k • 38
tayyibsupercool/resource_allocation_telecom_energy_efficiency_rician_k_4_instruct_10k Viewer • Updated 28 days ago • 12.5k • 35
tayyibsupercool/resource_allocation_telecom_spectral_efficiency_rician_k_4_instruct_10k Viewer • Updated 28 days ago • 12.5k • 39
tayyibsupercool/resource_allocation_telecom_energy_efficiency_rician_k_6_instruct_10k Viewer • Updated 28 days ago • 12.5k • 35
tayyibsupercool/resource_allocation_telecom_spectral_efficiency_rician_k_6_instruct_10k Viewer • Updated 28 days ago • 12.5k • 35
tayyibsupercool/resource_allocation_telecom_energy_efficiency_rician_k_8_instruct_10k Viewer • Updated 28 days ago • 12.5k • 40
tayyibsupercool/resource_allocation_telecom_spectral_efficiency_rician_k_8_instruct_10k Viewer • Updated 28 days ago • 12.5k • 39
tayyibsupercool/resource_allocation_telecom_energy_efficiency_rician_k_10_instruct_10k Viewer • Updated 28 days ago • 12.5k • 39
tayyibsupercool/resource_allocation_telecom_spectral_efficiency_rician_k_10_instruct_10k Viewer • Updated 28 days ago • 12.5k • 35
tayyibsupercool/resource_allocation_telecom_energy_efficiency_rician_k_12_instruct_10k Viewer • Updated 28 days ago • 12.5k • 43
tayyibsupercool/resource_allocation_telecom_spectral_efficiency_rician_k_12_instruct_10k Viewer • Updated 28 days ago • 12.5k • 39
aamina/channel_gains_vs_tx_powers_se_augmented_with_300_examples_context Viewer • Updated Oct 17 • 10k • 37
aamina/channel_gains_vs_tx_powers_se_augmented_with_30_examples_context_10k Viewer • Updated Oct 19 • 10k • 52
sert121/adult_dataset_with_instructions_balanced Viewer • Updated about 1 month ago • 15.7k • 132
tayyibsupercool/resource_allocation_telecom_energy_efficiency_rician_k_2_instruct_1k Viewer • Updated about 1 month ago • 1.25k • 39
tayyibsupercool/resource_allocation_telecom_spectral_efficiency_rician_k_2_instruct_1k Viewer • Updated about 1 month ago • 1.25k • 41
tayyibsupercool/resource_allocation_telecom_energy_efficiency_rician_k_4_instruct_1k Viewer • Updated about 1 month ago • 1.25k • 42
tayyibsupercool/resource_allocation_telecom_spectral_efficiency_rician_k_4_instruct_1k Viewer • Updated about 1 month ago • 1.25k • 38
tayyibsupercool/resource_allocation_telecom_energy_efficiency_rician_k_6_instruct_1k Viewer • Updated about 1 month ago • 1.25k • 40
tayyibsupercool/resource_allocation_telecom_spectral_efficiency_rician_k_6_instruct_1k Viewer • Updated about 1 month ago • 1.25k • 39
tayyibsupercool/resource_allocation_telecom_energy_efficiency_rician_k_8_instruct_1k Viewer • Updated about 1 month ago • 1.25k • 40
tayyibsupercool/resource_allocation_telecom_spectral_efficiency_rician_k_8_instruct_1k Viewer • Updated about 1 month ago • 1.25k • 39
tayyibsupercool/resource_allocation_telecom_energy_efficiency_rician_k_10_instruct_1k Viewer • Updated about 1 month ago • 1.25k • 40
tayyibsupercool/resource_allocation_telecom_spectral_efficiency_rician_k_10_instruct_1k Viewer • Updated about 1 month ago • 1.25k • 38
tayyibsupercool/resource_allocation_telecom_energy_efficiency_rician_k_12_instruct_1k Viewer • Updated about 1 month ago • 1.25k • 39
tayyibsupercool/resource_allocation_telecom_spectral_efficiency_rician_k_12_instruct_1k Viewer • Updated about 1 month ago • 1.25k • 40
MakiAi/OKU_wiki_llama3.1_8b_inst_Reflexive_chunk200_overlap700 Viewer • Updated 21 days ago • 703 • 33
antash420/long-context-text-summarization-alpaca-format Viewer • Updated 19 days ago • 216k • 115
Gramacho/complete_pira_train_val_corpus1_ptbr_llama3_alpaca_1484 Viewer • Updated 16 days ago • 1.48k • 75
namejun12000/AW_finetuning_5core_try1_all_final_valid Viewer • Updated 14 days ago • 22.4k • 40
namejun12000/AW_finetuning_5core_split1_all_final_valid Viewer • Updated 14 days ago • 22.4k • 63
Gramacho/complete_pira_test_corpus1_ptbr_llama3_alpaca_181 Viewer • Updated 16 days ago • 181 • 38
namejun12000/AW_finetuning_5core_try1_all_final_valid_include Viewer • Updated 2 days ago • 22.4k • 43
namejun12000/AW_finetuning_5core_split1_all_final_valid_include Viewer • Updated 2 days ago • 22.4k • 113
namejun12000/AW_finetuning_5core_try1_all_final_final Viewer • Updated 15 days ago • 22.4k • 28
namejun12000/AW_finetuning_5core_split1_all_final_final Viewer • Updated 15 days ago • 22.4k • 29
namejun12000/AW_finetuning_5core_split1_all_final_valid_include_50 Viewer • Updated 2 days ago • 22.4k • 93
namejun12000/AW_finetuning_5core_split1_all_final_valid_include_10 Viewer • Updated 2 days ago • 22.4k • 66
mlfoundations-dev/unnatural_instructions_gpt-4o-mini_test Viewer • Updated 8 days ago • 100 • 43
zsj999999999/llama3_medical_meadow_wikidoc_instruct_dataset Viewer • Updated 9 days ago • 10k • 21
namejun12000/AW_finetuning_5core_try1_all_final_valid_include_inference1 Viewer • Updated 2 days ago • 22.4k • 8
namejun12000/AW_finetuning_5core_try1_all_final_valid_include_inference2 Viewer • Updated 2 days ago • 22.4k • 8
namejun12000/AW_finetuning_5core_split1_all_final_valid_include_inference1 Viewer • Updated 2 days ago • 22.4k • 13
namejun12000/AW_finetuning_5core_split1_all_final_valid_include_inference2 Viewer • Updated 2 days ago • 22.4k • 9