Top 10% instruction tuning datasets
Collects datasets with 'instruction' in the name and more than 1 download and in the top 10% for the number of likes
Viewer • Updated • 7.15M • 1.97k • 51Note Dataset has the following tags: ['task_categories:other', 'annotations_creators:crowdsourced', 'annotations_creators:expert-generated', 'multilinguality:monolingual', 'size_categories:100M
qwedsacf/grade-school-math-instructions
Viewer • Updated • 8.79k • 190 • 45Note Dataset has the following tags: ['region:us']
HuggingFaceH4/instruction-dataset
Viewer • Updated • 327 • 577 • 47Note Dataset has the following tags: ['license:apache-2.0', 'region:us']
alespalla/chatbot_instruction_prompts
Viewer • Updated • 323k • 237 • 45Note Dataset has the following tags: ['task_categories:question-answering', 'task_categories:conversational', 'task_categories:text-generation', 'size_categories:100K
ArmelR/stack-exchange-instruction
Viewer • Updated • 12.2M • 814 • 66Note Dataset has the following tags: ['region:us']
MBZUAI/LaMini-instruction
Viewer • Updated • 2.59M • 3.64k • 127Note Dataset has the following tags: ['task_categories:text2text-generation', 'size_categories:1M
llm-wizard/dolly-15k-instruction-alpaca-format
Viewer • Updated • 15k • 268 • 29Note Dataset has the following tags: ['size_categories:10K
openllmplayground/pandagpt_visual_instruction_dataset
Preview • Updated • 138 • 13Note Dataset has the following tags: ['license:cc-by-nc-sa-4.0']
rewoo/planner_instruction_tuning_2k
Viewer • Updated • 2.04k • 64 • 31Note Dataset has the following tags: ['license:mit', 'region:us']
LinkSoul/instruction_merge_set
Viewer • Updated • 10.1M • 265 • 119Note Dataset has the following tags: ['region:us']
zjunlp/Mol-Instructions
Updated • 769 • 44Note Dataset has the following tags: ['size_categories:100M
TokenBender/code_instructions_122k_alpaca_style
Viewer • Updated • 122k • 470 • 69Note Dataset has the following tags: ['license:apache-2.0', 'region:us']
codefuse-ai/Evol-instruction-66k
Updated • 209 • 72Note Dataset has the following tags: ['license:cc-by-nc-sa-4.0', 'region:us']