K - a diwank Collection

chat-ui

jondurbin/py-dpo-v0.1

Viewer • Updated Jan 11 • 9.47k • 170 • 44

jondurbin/gutenberg-dpo-v0.1

Viewer • Updated Jan 12 • 918 • 1.62k • 124

jondurbin/cinematika-v0.1

Viewer • Updated Apr 11 • 47.1k • 393 • 53

ParisNeo/lollms_aware_dataset

Viewer • Updated Oct 27, 2023 • 464 • 110 • 5

grimulkan/LimaRP-augmented

Viewer • Updated Jan 24 • 804 • 67 • 29

TIGER-Lab/MathInstruct

Viewer • Updated May 15 • 262k • 1.8k • 247

christopher/rosetta-code

Viewer • Updated Sep 24, 2023 • 79k • 216 • 31

b-mc2/sql-create-context

Viewer • Updated Jan 25 • 78.6k • 2.83k • 407

migtissera/Synthia-v1.3

Viewer • Updated Nov 16, 2023 • 119k • 125 • 99

tinyBenchmarks/tinyMMLU

Viewer • Updated Jul 8 • 385 • 3.43k • 16

tinyBenchmarks/tinyWinogrande

Preview • Updated May 25 • 2.31k • 3

tinyBenchmarks/tinyAI2_arc

Preview • Updated May 25 • 2.17k • 3

tinyBenchmarks/tinyHellaswag

Viewer • Updated May 25 • 50k • 2.36k • 4

tinyBenchmarks/tinyTruthfulQA

Preview • Updated May 25 • 1.89k • 3

tinyBenchmarks/tinyAlpacaEval

Viewer • Updated Apr 19 • 100 • 101 • 4

tinyBenchmarks/tinyGSM8k

Preview • Updated May 25 • 2.47k • 5

cognitivecomputations/samantha-data

Updated Mar 29 • 880 • 123

roborovski/synthetic-tool-calls

Viewer • Updated Mar 5 • 6.01k • 41 • 1

roborovski/glaive-tool-usage-dpo

Viewer • Updated Feb 29 • 42k • 47 • 2

kalomaze/StackMix-v0.1

Viewer • Updated Feb 28 • 30 • 44 • 2

roborovski/glaive-function-calling-v2-conversation

Viewer • Updated Feb 19 • 113k • 37 • 2

mlabonne/truthy-dpo-v0.1

Viewer • Updated Feb 18 • 1.02k • 45 • 1

ai4bharat/indic-align

Viewer • Updated Jul 25 • 97.4M • 750 • 10

coseal/CodeUltraFeedback_binarized

Viewer • Updated Mar 18 • 9.5k • 61 • 15

coseal/CodeUltraFeedback

Viewer • Updated Mar 15 • 10k • 103 • 25

KTO: Model Alignment as Prospect Theoretic Optimization

Paper • 2402.01306 • Published Feb 2 • 15

ai4bharat/sangraha

Viewer • Updated 26 days ago • 268M • 12.7k • 30

Rephrase and Respond: Let Large Language Models Ask Better Questions for Themselves

Paper • 2311.04205 • Published Nov 7, 2023 • 5

Multilingual Instruction Tuning With Just a Pinch of Multilinguality

Paper • 2401.01854 • Published Jan 3 • 10

Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models

Paper • 2401.01335 • Published Jan 2 • 64

GAIA: a benchmark for General AI Assistants

Paper • 2311.12983 • Published Nov 21, 2023 • 183

Self-Instruct: Aligning Language Model with Self Generated Instructions

Paper • 2212.10560 • Published Dec 20, 2022 • 8

HuggingFaceH4/self-instruct-seed

Viewer • Updated Jan 31, 2023 • 175 • 360 • 25

ToolTalk: Evaluating Tool-Usage in a Conversational Setting

Paper • 2311.10775 • Published Nov 15, 2023 • 7

Dynamic Planning with a LLM

Paper • 2308.06391 • Published Aug 11, 2023 • 2

FreedomIntelligence/SocraticChat

Viewer • Updated Oct 12, 2023 • 50.7k • 45 • 6

Large Language Model as a User Simulator

Paper • 2308.11534 • Published Aug 21, 2023 • 2

Natural Language Embedded Programs for Hybrid Language Symbolic Reasoning

Paper • 2309.10814 • Published Sep 19, 2023 • 3

AlpaGasus: Training A Better Alpaca with Fewer Data

Paper • 2307.08701 • Published Jul 17, 2023 • 22

mlabonne/alpagasus

Viewer • Updated Aug 3, 2023 • 9.23k • 53 • 8

AgentTuning: Enabling Generalized Agent Abilities for LLMs

Paper • 2310.12823 • Published Oct 19, 2023 • 35

THUDM/AgentInstruct

Viewer • Updated Oct 23, 2023 • 1.87k • 296 • 198

Diversity of Thought Improves Reasoning Abilities of Large Language Models

Paper • 2310.07088 • Published Oct 11, 2023 • 5

SmartPlay : A Benchmark for LLMs as Intelligent Agents

Paper • 2310.01557 • Published Oct 2, 2023 • 12

Large Language Models Cannot Self-Correct Reasoning Yet

Paper • 2310.01798 • Published Oct 3, 2023 • 33

MINT: Evaluating LLMs in Multi-turn Interaction with Tools and Language Feedback

Paper • 2309.10691 • Published Sep 19, 2023 • 4

LLM+P: Empowering Large Language Models with Optimal Planning Proficiency

Paper • 2304.11477 • Published Apr 22, 2023 • 3

Quiet-STaR: Language Models Can Teach Themselves to Think Before Speaking

Paper • 2403.09629 • Published Mar 14 • 72

SelfCheck: Using LLMs to Zero-Shot Check Their Own Step-by-Step Reasoning

Paper • 2308.00436 • Published Aug 1, 2023 • 21

Running

439

📢

UGI Leaderboard

MuSR: Testing the Limits of Chain-of-thought with Multistep Soft Reasoning

Paper • 2310.16049 • Published Oct 24, 2023 • 4

Instruction-Following Evaluation for Large Language Models

Paper • 2311.07911 • Published Nov 14, 2023 • 19

allenai/UNcommonsense

Viewer • Updated Jan 19 • 18.3k • 89 • 8

UNcommonsense Reasoning: Abductive Reasoning about Uncommon Situations

Paper • 2311.08469 • Published Nov 14, 2023 • 10

Flows: Building Blocks of Reasoning and Collaborating AI

Paper • 2308.01285 • Published Aug 2, 2023 • 2

aiflows/CCFlows

Updated Dec 10, 2023 • 2

Learning to Reason and Memorize with Self-Notes

Paper • 2305.00833 • Published May 1, 2023 • 4

Verify-and-Edit: A Knowledge-Enhanced Chain-of-Thought Framework

Paper • 2305.03268 • Published May 5, 2023 • 2

Making Large Language Models Better Reasoners with Alignment

Paper • 2309.02144 • Published Sep 5, 2023 • 2

Reason for Future, Act for Now: A Principled Framework for Autonomous LLM Agents with Provable Sample Efficiency

Paper • 2309.17382 • Published Sep 29, 2023 • 4

ALERT: Adapting Language Models to Reasoning Tasks

Paper • 2212.08286 • Published Dec 16, 2022 • 2

CodeIt: Self-Improving Language Models with Prioritized Hindsight Replay

Paper • 2402.04858 • Published Feb 7 • 14

Vivacem/MMIQC

Viewer • Updated Jan 20 • 2.29M • 83 • 14

LLMs in the Imaginarium: Tool Learning through Simulated Trial and Error

Paper • 2403.04746 • Published Mar 7 • 22

Learning to Decode Collaboratively with Multiple Language Models

Paper • 2403.03870 • Published Mar 6 • 18

Large Language Models as Zero-shot Dialogue State Tracker through Function Calling

Paper • 2402.10466 • Published Feb 16 • 16

SynthDST: Synthetic Data is All You Need for Few-Shot Dialog State Tracking

Paper • 2402.02285 • Published Feb 3 • 1

When Scaling Meets LLM Finetuning: The Effect of Data, Model and Finetuning Method

Paper • 2402.17193 • Published Feb 27 • 23

Towards Optimal Learning of Language Models

Paper • 2402.17759 • Published Feb 27 • 16

Evaluating Very Long-Term Conversational Memory of LLM Agents

Paper • 2402.17753 • Published Feb 27 • 18

Aman279/Locomo

Viewer • Updated Mar 7 • 35 • 8 • 1

Generative Representational Instruction Tuning

Paper • 2402.09906 • Published Feb 15 • 51

Synthetic Data (Almost) from Scratch: Generalized Instruction Tuning for Language Models

Paper • 2402.13064 • Published Feb 20 • 46

OpenCodeInterpreter: Integrating Code Generation with Execution and Refinement

Paper • 2402.14658 • Published Feb 22 • 82

Beyond A*: Better Planning with Transformers via Search Dynamics Bootstrapping

Paper • 2402.14083 • Published Feb 21 • 47

PerLTQA: A Personal Long-Term Memory Dataset for Memory Classification, Retrieval, and Synthesis in Question Answering

Paper • 2402.16288 • Published Feb 26 • 1

pandalla/Machine_Mindset_MBTI_dataset

Viewer • Updated Jun 4 • 161k • 520 • 51

berkeley-nest/Nectar

Viewer • Updated Mar 20 • 183k • 582 • 277

totally-not-an-llm/sharegpt-hyperfiltered-3k

Viewer • Updated Jul 13, 2023 • 3.24k • 86 • 14

HuggingFaceTB/cosmopedia

Viewer • Updated Aug 12 • 31.1M • 12k • 562

argilla/ultrafeedback-binarized-preferences-cleaned

Viewer • Updated Dec 11, 2023 • 60.9k • 7.88k • 124

dmayhem93/self-critiquing-refine

Viewer • Updated Apr 8, 2023 • 39.2k • 35 • 1

dmayhem93/self-critiquing-critique-and-refine

Viewer • Updated Apr 8, 2023 • 39.2k • 38 • 1

morzecrew/RefinedPersonaChat

Viewer • Updated Aug 7, 2023 • 207k • 62 • 2

beratcmn/rephrased-instruction-turkish-poems

Viewer • Updated Dec 16, 2023 • 4.96k • 39 • 4

Birchlabs/openai-prm800k-stepwise-critic

Viewer • Updated Jun 3, 2023 • 1.09M • 281 • 43

theblackcat102/evol-codealpaca-v1

Viewer • Updated Mar 10 • 111k • 1k • 150

meta-math/GSM8K_Backward

Viewer • Updated Nov 10, 2023 • 1.27k • 53 • 15

meta-math/MetaMathQA-40K

Viewer • Updated Nov 10, 2023 • 40k • 195 • 20

glaiveai/glaive-code-assistant-v2

Viewer • Updated Apr 4 • 215k • 76 • 43

Towards General Computer Control: A Multimodal Agent for Red Dead Redemption II as a Case Study

Paper • 2403.03186 • Published Mar 5 • 5

PROC2PDDL: Open-Domain Planning Representations from Texts

Paper • 2403.00092 • Published Feb 29 • 1

btan2/cappy-large

Text Classification • Updated Dec 7, 2023 • 42 • 19

VMware/open-instruct

Viewer • Updated Jul 12, 2023 • 143k • 137 • 44

QizhiPei/BioT5_finetune_dataset

Viewer • Updated Sep 2 • 33 • 363 • 5

Tensoic/gooftagoo

Viewer • Updated Mar 16 • 16.2k • 45 • 9

GenVRadmin/Aryabhatta-Orca-Maths-Hindi

Viewer • Updated Mar 18 • 200k • 41 • 3

Corex: Pushing the Boundaries of Complex Reasoning through Multi-Model Collaboration

Paper • 2310.00280 • Published Sep 30, 2023 • 3

JARVIS-1: Open-World Multi-task Agents with Memory-Augmented Multimodal Language Models

Paper • 2311.05997 • Published Nov 10, 2023 • 36

wangwilliamyang/wikihow

Updated Jan 18 • 9

argilla/distilabel-capybara-kto-15k-binarized

Viewer • Updated Mar 19 • 15.1k • 67 • 4

argilla/ultrafeedback-binarized-preferences-cleaned-kto

Viewer • Updated Mar 19 • 231k • 130 • 8

argilla/distilabel-intel-orca-kto

Viewer • Updated Mar 19 • 23.1k • 40 • 5

argilla/kto-mix-15k

Viewer • Updated Apr 19 • 15.3k • 73 • 13

KnutJaegersberg/dolphin_orca_clustered

Updated Sep 14, 2023 • 36 • 1

GAIR/autoj-scenario-classifier

Text Generation • Updated Oct 9, 2023 • 20 • 5

Orca 2: Teaching Small Language Models How to Reason

Paper • 2311.11045 • Published Nov 18, 2023 • 70

GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

Paper • 2403.03507 • Published Mar 6 • 182

Ask Optimal Questions: Aligning Large Language Models with Retriever's Preference in Conversational Search

Paper • 2402.11827 • Published Feb 19 • 1

Grounding Language Model with Chunking-Free In-Context Retrieval

Paper • 2402.09760 • Published Feb 15

Agent-FLAN: Designing Data and Methods of Effective Agent Tuning for Large Language Models

Paper • 2403.12881 • Published Mar 19 • 16

BAAI/OPI

Preview • Updated 10 days ago • 473 • 8

internlm/Agent-FLAN

Preview • Updated Mar 20 • 103 • 65

kaist-ai/selfee-train

Viewer • Updated May 31, 2023 • 178k • 56 • 9

fabiochiu/medium-articles

Preview • Updated Jul 17, 2022 • 130 • 23

Reverse Training to Nurse the Reversal Curse

Paper • 2403.13799 • Published Mar 20 • 13

voidful/MuSiQue

Preview • Updated May 20, 2023 • 42 • 4

BAAI/bge-reranker-v2-m3

Text Classification • Updated Jun 24 • 600k • 381

allenai/reward-bench

Viewer • Updated Sep 9 • 8.11k • 7.32k • 75

Sorted LLaMA: Unlocking the Potential of Intermediate Layers of Large Language Models for Dynamic Inference Using Sorted Fine-Tuning (SoFT)

Paper • 2309.08968 • Published Sep 16, 2023 • 22

In-Context Learning Creates Task Vectors

Paper • 2310.15916 • Published Oct 24, 2023 • 41

Are Emergent Abilities in Large Language Models just In-Context Learning?

Paper • 2309.01809 • Published Sep 4, 2023 • 3

ZenMoore/RoleBench

Preview • Updated Nov 23, 2023 • 368 • 75

LLM Agent Operating System

Paper • 2403.16971 • Published Mar 25 • 65

ORPO: Monolithic Preference Optimization without Reference Model

Paper • 2403.07691 • Published Mar 12 • 62

princeton-nlp/QuRatedPajama-260B

Viewer • Updated Apr 16 • 254M • 550 • 6

Arcee's MergeKit: A Toolkit for Merging Large Language Models

Paper • 2403.13257 • Published Mar 20 • 20

Can large language models explore in-context?

Paper • 2403.15371 • Published Mar 22 • 32

Locutusque/OpenCerebrum-dpo

Viewer • Updated Mar 26 • 21.1k • 44 • 6

Doctor-Shotgun/theory-of-mind-dpo

Viewer • Updated Mar 14 • 539 • 63 • 16

Locutusque/arc-cot-dpo

Viewer • Updated Mar 26 • 957 • 37 • 5

fblgit/simple-math-DPO

Viewer • Updated Aug 1 • 800k • 169 • 16

KrisPi/PythonTutor-Evol-1k-DPO-GPT4_vs_35

Viewer • Updated Nov 18, 2023 • 943 • 37 • 13

zerolink/zsql-postgres-dpo

Viewer • Updated Feb 2 • 259k • 65 • 6

Lakera/gandalf_ignore_instructions

Viewer • Updated Oct 2, 2023 • 1k • 282 • 26

mrm8488/unnatural-instructions-full

Viewer • Updated Dec 21, 2022 • 66k • 91 • 16

NilanE/SmallParallelDocs-Ja_En-6k

Viewer • Updated Mar 5 • 6.32k • 149 • 2

Long-form factuality in large language models

Paper • 2403.18802 • Published Mar 27 • 24

NousResearch/OLMo-Bitnet-1B

Text Generation • Updated Apr 11 • 9.46k • 118

pyp1/VoiceCraft

Text-to-Speech • Updated Aug 21 • 41 • 205

CarperAI/openai_summarize_comparisons

Viewer • Updated Feb 27, 2023 • 260k • 1.68k • 39

PygmalionAI/PIPPA

Updated Sep 7, 2023 • 193 • 202

ivanleomk/gpt4-chain-of-density

Preview • Updated Nov 12, 2023 • 70 • 6

AIRI-NLP/cnli_memory_extracted

Viewer • Updated Mar 22 • 8.23k • 54 • 1

Lumos: Learning Agents with Unified Data, Modular Design, and Open-Source LLMs

Paper • 2311.05657 • Published Nov 9, 2023 • 27

openbmb/UltraInteract_sft

Viewer • Updated Apr 5 • 289k • 1.26k • 118

openbmb/UltraInteract_pair

Viewer • Updated Apr 5 • 220k • 658 • 104

openbmb/Eurus-70b-nca

Text Generation • Updated Apr 12 • 234 • 11

Noise Contrastive Alignment of Language Models with Explicit Rewards

Paper • 2402.05369 • Published Feb 8 • 1

ai2lumos/lumos_multimodal_ground_iterative

Viewer • Updated Mar 19 • 15.9k • 39 • 1

ai2lumos/lumos_multimodal_plan_iterative

Viewer • Updated Mar 19 • 15.9k • 51 • 2

ai2lumos/lumos_complex_qa_plan_onetime

Viewer • Updated Mar 19 • 19.4k • 60 • 3

ai2lumos/lumos_complex_qa_ground_onetime

Viewer • Updated Mar 19 • 19.2k • 66 • 3

ai2lumos/lumos_complex_qa_ground_iterative

Viewer • Updated Mar 19 • 19.1k • 99 • 2

ai2lumos/lumos_unified_plan_iterative

Viewer • Updated Mar 19 • 55.4k • 57 • 2

ai2lumos/lumos_complex_qa_plan_iterative

Viewer • Updated Mar 18 • 19k • 113 • 6

ai2lumos/lumos_unified_ground_iterative

Viewer • Updated Mar 19 • 55.5k • 56 • 2

ai2lumos/lumos_web_agent_ground_iterative

Viewer • Updated Mar 18 • 1.01k • 46 • 2

ai2lumos/lumos_web_agent_plan_iterative

Viewer • Updated Mar 18 • 1.01k • 48 • 4

ai2lumos/lumos_maths_ground_iterative

Viewer • Updated Mar 18 • 19.5k • 56 • 3

ai2lumos/lumos_maths_ground_onetime

Viewer • Updated Mar 18 • 19.8k • 41 • 1

ai2lumos/lumos_maths_plan_onetime

Viewer • Updated Mar 18 • 19.8k • 49 • 2

Symbol-LLM/Symbol-LLM-7B-Instruct

Text Generation • Updated Jun 23 • 55 • 13

MoritzLaurer/deberta-v3-large-zeroshot-v2.0

Zero-Shot Classification • Updated Apr 11 • 90.9k • 82

MoritzLaurer/bge-m3-zeroshot-v2.0

Zero-Shot Classification • Updated Apr 22 • 24.4k • 41

What Makes Good Data for Alignment? A Comprehensive Study of Automatic Data Selection in Instruction Tuning

Paper • 2312.15685 • Published Dec 25, 2023 • 17

iamrishiraj/com_qa

Viewer • Updated Feb 7 • 7.18k • 86 • 3

Pavithree/eli5

Viewer • Updated Apr 23, 2022 • 229k • 368 • 2

vicgalle/configurable-system-prompt-multitask

Viewer • Updated Apr 23 • 1.95k • 153 • 19

paraloq/json_data_extraction

Viewer • Updated Mar 25 • 484 • 71 • 17

livecodebench/execution

Viewer • Updated Mar 12 • 479 • 59 • 4

iamtarun/python_code_instructions_18k_alpaca

Viewer • Updated Jul 27, 2023 • 18.6k • 1.92k • 230

LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement

Paper • 2403.15042 • Published Mar 22 • 25

manishiitg/CogStack-QA

Viewer • Updated Feb 9 • 24.7k • 38 • 1

manishiitg/CogStack-Tasks

Viewer • Updated Feb 9 • 4.69k • 33 • 1

manishiitg/CogStack-Conv

Viewer • Updated Feb 9 • 2.35k • 40 • 1

Reformatted Alignment

Paper • 2402.12219 • Published Feb 19 • 15

abacusai/SystemChat-1.1

Viewer • Updated Apr 11 • 20.2k • 96 • 30

Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention

Paper • 2404.07143 • Published Apr 10 • 103

Anthropic/persuasion

Viewer • Updated Apr 9 • 3.94k • 399 • 175

Rho-1: Not All Tokens Are What You Need

Paper • 2404.07965 • Published Apr 11 • 84

M4-ai/prm_dpo_pairs

Viewer • Updated Jul 1 • 93.9k • 62 • 7

OpenLLM-France/Claire-Dialogue-French-0.1

Viewer • Updated Dec 5, 2023 • 37k • 448 • 40

amaydle/npc-dialogue

Viewer • Updated Mar 25, 2023 • 1.92k • 75 • 15

facebook/empathetic_dialogues

Updated Jan 18 • 1.1k • 92

Salesforce/dialogstudio

Updated Jul 21 • 944 • 214

Direct Nash Optimization: Teaching Language Models to Self-Improve with General Preferences

Paper • 2404.03715 • Published Apr 4 • 60

microsoft/Taskbench

Viewer • Updated Aug 21 • 17.3k • 603 • 20

Learn Your Reference Model for Real Good Alignment

Paper • 2404.09656 • Published Apr 15 • 82

CantTalkAboutThis: Aligning Language Models to Stay on Topic in Dialogues

Paper • 2404.03820 • Published Apr 4 • 24

mlabonne/orpo-dpo-mix-40k

Viewer • Updated 30 days ago • 44.2k • 1.52k • 245

allenai/persona-bias

Updated Feb 5 • 2.07k • 11

PleIAs/YouTube-Commons

Updated Jun 26 • 789 • 318

FreedomIntelligence/evol-instruct-hindi

Viewer • Updated Aug 6, 2023 • 59k • 13 • 2

FreedomIntelligence/OVM-process

Viewer • Updated Apr 1 • 7.47k • 39 • 1

nuprl/CanItEdit

Viewer • Updated Mar 19 • 105 • 246 • 11

totally-not-an-llm/EverythingLM-data-V3

Viewer • Updated Sep 11, 2023 • 1.07k • 73 • 31

RUCAIBox/Story-Generation

Updated Mar 3, 2023 • 58 • 11

fabraz/writingPromptAug

Viewer • Updated Oct 14, 2023 • 24.1k • 114 • 2

jerryjalapeno/nart-100k-synthetic

Viewer • Updated Jul 16, 2023 • 99.1k • 90 • 38

jat-project/jat-dataset

Viewer • Updated Feb 16 • 258M • 241k • 33

euclaise/ReMask-3B

Text Generation • Updated Aug 10 • 77 • 15

google/Synthetic-Persona-Chat

Viewer • Updated Mar 1 • 10.9k • 1.19k • 76

google/cvss

Updated Feb 10 • 191 • 12

neural-bridge/rag-dataset-12000

Viewer • Updated Feb 5 • 12k • 1.66k • 109

HannahRoseKirk/prism-alignment

Viewer • Updated Apr 25 • 77.9k • 994 • 60

Gigax/NPC-LLM-3_8B

Text Generation • Updated May 14 • 445 • 24

nuprl/MultiPL-T

Viewer • Updated Aug 20 • 215k • 287 • 7

cognitivecomputations/SystemChat-1.2

Viewer • Updated Apr 30 • 52 • 53 • 6

mlabonne/arena-preferences

Viewer • Updated Apr 27 • 2.69k • 70 • 9

INTERS: Unlocking the Power of Large Language Models in Search with Instruction Tuning

Paper • 2401.06532 • Published Jan 12 • 10

Flexibly Scaling Large Language Models Contexts Through Extensible Tokenization

Paper • 2401.07793 • Published Jan 15 • 3

yutaozhu94/INTERS

Preview • Updated Feb 19 • 915 • 12

THUDM/CogAgent

Updated Dec 18, 2023 • 16

urchade/gliner_large-v2.1

Token Classification • Updated Apr 10 • 11.4k • 27

shachardon/ShareLM

Viewer • Updated Aug 6 • 331k • 250 • 28

nvidia/ChatQA-Training-Data

Viewer • Updated Jun 4 • 442k • 1.07k • 160

lightblue/tagengo-gpt4

Viewer • Updated Jun 2 • 78.1k • 130 • 61

Efficient-Large-Model/Llama-3-VILA1.5-8B

Text Generation • Updated Aug 16 • 38.4k • 29

bigcode/commitpackft

Viewer • Updated Aug 20, 2023 • 702k • 5.75k • 61

glaiveai/glaive-code-assistant-v3

Viewer • Updated May 20 • 950k • 461 • 44

davanstrien/cosmochat

Viewer • Updated May 10 • 199 • 70 • 11

davanstrien/cosmopedia_chat

Viewer • Updated Mar 8 • 1.19k • 74 • 7

MemGPT/MSC-Self-Instruct

Viewer • Updated Nov 2, 2023 • 500 • 244 • 11

MemGPT/qa_data

Viewer • Updated Feb 6 • 18.6k • 17 • 1

google/imageinwords

Updated May 25 • 221 • 115

grammarly/coedit

Viewer • Updated Oct 21, 2023 • 70.8k • 849 • 63

bea2019st/wi_locness

Updated Jan 18 • 197 • 14

GEM/FairytaleQA

Viewer • Updated Oct 25, 2022 • 10.6k • 84 • 8

grammarly/medit

Viewer • Updated Oct 1 • 113k • 88 • 13

MemGPT/MemGPT-DPO-Dataset

Viewer • Updated Apr 18 • 42.3k • 52 • 8

lmsys/lmsys-arena-human-preference-55k

Viewer • Updated May 17 • 57.5k • 1.12k • 135

princeton-nlp/QuRating-GPT3.5-Judgments

Viewer • Updated Mar 29 • 250k • 42 • 5

princeton-nlp/AutoCompressor-Llama-2-7b-6k

Updated Nov 22, 2023 • 2.33k • 2

H-D-T/Select-Stack

Viewer • Updated Sep 2 • 1.46M • 42 • 16

EleutherAI/lichess-puzzles

Viewer • Updated May 9 • 1.48M • 55 • 20

selfrag/selfrag_train_data

Viewer • Updated Oct 31, 2023 • 146k • 147 • 66

community-datasets/yahoo_answers_topics

Viewer • Updated Jun 24 • 1.46M • 1.26k • 54

TIGER-Lab/MMLU-Pro

Viewer • Updated 29 days ago • 12.1k • 28.9k • 283

ylacombe/expresso

Viewer • Updated Apr 30 • 11.6k • 251 • 32

microsoft/MeetingBank-QA-Summary

Viewer • Updated May 16 • 862 • 169 • 11

microsoft/MeetingBank-LLMCompressed

Viewer • Updated May 16 • 5.17k • 94 • 14

nvidia/ChatRAG-Bench

Viewer • Updated May 24 • 34.6k • 1.59k • 100

xingyaoww/code-act

Viewer • Updated Feb 5 • 78.4k • 273 • 49

kaist-ai/Multifaceted-Collection-ORPO

Viewer • Updated Jul 1 • 64.6k • 54 • 9

Alibaba-NLP/gte-Qwen2-7B-instruct

hwjiang/Real3D

Image-to-3D • Updated Jun 14 • 11 • 12

nvidia/Aegis-AI-Content-Safety-Dataset-1.0

Viewer • Updated Jun 28 • 12k • 994 • 44

ProGamerGov/synthetic-dataset-1m-dalle3-high-quality-captions

Updated 17 days ago • 2.7k • 119

OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text

Paper • 2406.08418 • Published Jun 12 • 28

facebook/multi-token-prediction

Updated Jun 18 • 350

TIGER-Lab/M-BEIR

Viewer • Updated Aug 7 • 2.86M • 854 • 11

tomg-group-umd/pixelprose

Viewer • Updated Jun 23 • 15.6M • 1.08k • 125

mit-han-lab/ShareGPT4V

Preview • Updated Feb 22 • 26 • 3

mit-han-lab/litepose

Updated Jun 5 • 1

mit-han-lab/Llama-3-8B-Instruct-QServe-g128

Text Generation • Updated May 6 • 10 • 1

internlm/internlm-xcomposer2-vl-7b

Visual Question Answering • Updated Apr 12 • 3.91k • 79

OpenGVLab/InternViT-6B-448px-V1-5

Image Feature Extraction • Updated Aug 23 • 10k • 74

OpenGVLab/InternVL-Chat-V1-5

Image-Text-to-Text • Updated Sep 24 • 3.87k • 401

OpenGVLab/Mini-InternVL-Chat-4B-V1-5

Image-Text-to-Text • Updated Sep 24 • 1.39k • 58

openbmb/MiniCPM-Llama3-V-2_5

Image-Text-to-Text • Updated Sep 25 • 44.8k • 1.37k

microsoft/Florence-2-large

Image-Text-to-Text • Updated 1 day ago • 841k • 1.23k

llava-hf/LLaVA-NeXT-Video-7B-DPO-hf

Video-Text-to-Text • Updated Aug 16 • 2.85k • 8

arcee-ai/BAAI-Infinity-Instruct-System

Viewer • Updated Jun 24 • 2.36M • 185 • 15

hpcai-tech/OpenSora-VAE-v1.2

Updated Jun 17 • 457k • 53

hpcai-tech/OpenSora-STDiT-v3

Updated Jun 17 • 226k • 41

liuqi6777/RankGPT-msmarco-100k-clean

Viewer • Updated Feb 6 • 87.3k • 48 • 1

failspy/Meta-Llama-3-70B-Instruct-abliterated-v3.5

Text Generation • Updated May 30 • 3.79k • 37

ResplendentAI/NSFW_RP_Format_DPO

Viewer • Updated Mar 17 • 400 • 53 • 59

microsoft/msr_text_compression

Updated Jan 18 • 76 • 8

microsoft/msr_sqa

Updated Jan 18 • 90 • 4

microsoft/crd3

Updated Jan 18 • 178 • 22

nvidia/domain-classifier

Updated Jun 24 • 58.4k • 54

jhu-clsp/FollowIR-train

Viewer • Updated Mar 25 • 1.78k • 47 • 5

vicgalle/Phudge-3

Text Classification • Updated May 30 • 8 • 3

TWO/sutra-mlt256-v2

Updated May 24 • 8

AutoRAG-HP: Automatic Online Hyper-Parameter Tuning for Retrieval-Augmented Generation

Paper • 2406.19251 • Published Jun 27 • 8

aiana94/xMINDlarge

Viewer • Updated 22 days ago • 4.12M • 84 • 4

OpenCo7/UpVoteWeb

Viewer • Updated Jul 17 • 557M • 493 • 92

davanstrien/magpie-preference

Viewer • Updated 3 days ago • 489 • 863 • 11

FunAudioLLM/SenseVoiceSmall

Updated Jul 31 • 2.82k • 170

euclaise/gsm8k_multiturn

Viewer • Updated Jul 6 • 8.79k • 57 • 13

internlm/internlm-xcomposer2d5-7b

Visual Question Answering • Updated Jul 22 • 17.9k • 181

dell-research-harvard/newswire

Viewer • Updated Jul 2 • 1.44M • 454 • 66

alexshengzhili/SciGraphQA-295K-train

Viewer • Updated Aug 8, 2023 • 296k • 127 • 11

xinsir/controlnet-union-sdxl-1.0

Text-to-Image • Updated Jul 30 • 82.4k • 1.11k

T-FREE: Tokenizer-Free Generative LLMs via Sparse Representations for Memory-Efficient Embeddings

Paper • 2406.19223 • Published Jun 27 • 8

laion/links_to_pocasts_lecture_and_shows_for_tts

Viewer • Updated May 29 • 331k • 11 • 8

laion/datacomp-hq

Viewer • Updated Mar 13 • 20.7M • 189 • 10

laion/Subjects-for-curricular

Viewer • Updated Dec 20, 2023 • 3.99M • 76 • 5

laion/strategic_game_maze

Viewer • Updated Oct 20, 2023 • 345M • 20.1k • 10

mlabonne/llmtwin

Viewer • Updated Aug 27 • 3.34k • 101 • 7

Multimodal Self-Instruct: Synthetic Abstract Image and Visual Reasoning Instruction Using Language Model

Paper • 2407.07053 • Published Jul 9 • 41

dunzhang/stella_en_400M_v5

dunzhang/stella_en_1.5B_v5

RhapsodyAI/MiniCPM-V-Embedding-preview

Feature Extraction • Updated Aug 20 • 512 • 43

agentsea/wave-ui-25k

Viewer • Updated Jul 3 • 25k • 523 • 16

TencentARC/StoryStream

Preview • Updated Jul 17 • 380 • 22

apple/DCLM-7B

Updated Jul 26 • 1.65k • 824

HuggingFaceTB/smollm-corpus

Viewer • Updated Sep 6 • 237M • 12.8k • 242

HuggingFaceTB/bisac-topics

Viewer • Updated Apr 3 • 5.5k • 11 • 2

From GaLore to WeLore: How Low-Rank Weights Non-uniformly Emerge from Low-Rank Gradients

Paper • 2407.11239 • Published Jul 15 • 7

mistralai/Mistral-Nemo-Base-2407

Text Generation • Updated 10 days ago • 54k • 258

TencentARC/SEED-Story

Text-to-Image • Updated Aug 26 • 54 • 24

xlangai/BRIGHT

Viewer • Updated 13 days ago • 1.35M • 2.79k • 18

glaiveai/RAG-v1

Viewer • Updated Jun 25 • 51.4k • 388 • 62

QuietImpostor/Claude-3-Opus-Claude-3.5-Sonnnet-9k

Viewer • Updated Jun 30 • 9.94k • 79 • 16

PawanKrd/gpt-4o-200k

Viewer • Updated Jun 29 • 200k • 71 • 23

kalomaze/Opus_Instruct_3k

Viewer • Updated Jul 19 • 2.95k • 85 • 24

Coarse-to-Fine Vision-Language Pre-training with Fusion in the Backbone

Paper • 2206.07643 • Published Jun 15, 2022 • 1

Active Self-Supervised Learning: A Few Low-Cost Relationships Are All You Need

Paper • 2303.15256 • Published Mar 27, 2023 • 1

fireworks-ai/llama-3-firefunction-v2

Text Generation • Updated Jun 18 • 219 • 136

Stateful Memory-Augmented Transformers for Dialogue Modeling

Paper • 2209.07634 • Published Sep 15, 2022 • 1

cognitivecomputations/SystemChat-2.0

Preview • Updated May 31 • 87 • 53

CollectiveCognition/chats-data-2023-10-16

Viewer • Updated Oct 16, 2023 • 200 • 52 • 21

Izazk/Sequence-of-action-prediction-mind2web

Viewer • Updated Feb 22 • 68.9k • 63 • 3

BigAction/mind2web_clean

Viewer • Updated Apr 25 • 199 • 60 • 4

osunlp/Mind2Web

Viewer • Updated Jul 19, 2023 • 253 • 548 • 90

magicgh/MT-Mind2Web

Viewer • Updated Feb 23 • 259 • 89 • 2

TencentARC/PhotoMaker-V2

Text-to-Image • Updated Jul 22 • 19.1k • 118

KevSun/Personality_LM

Text Classification • Updated Jul 29 • 156 • 15

Running

239

♾️📚

Infinite Dataset Hub

Search and save datasets generated with a LLM in real time

chargoddard/SlimOrcaDedupCleaned-Sonnet3.5-DPO

Viewer • Updated Jul 23 • 168k • 49 • 7

nvidia/Minitron-8B-Base

Updated Aug 20 • 108 • 63

mlfoundations/MINT-1T-HTML

Viewer • Updated Sep 21 • 623M • 57.6k • 75

mlfoundations/MINT-1T-ArXiv

Viewer • Updated Sep 19 • 5.6M • 7.36k • 48

mlfoundations/MINT-1T-PDF-CC-2024-18

Updated Sep 19 • 15.2k • 19

AI-MO/NuminaMath-TIR

Viewer • Updated Jul 19 • 72.5k • 694 • 63

DistilDIRE: A Small, Fast, Cheap and Lightweight Diffusion Synthesized Deepfake Detection

Paper • 2406.00856 • Published Jun 2 • 10

mlabonne/FineTome-100k

Viewer • Updated Jul 29 • 100k • 8.47k • 121

LiruiZhao/Diffree

Image-to-Image • Updated Jul 29 • 51 • 17

BAAI/bge-multilingual-gemma2

Feature Extraction • Updated Jul 31 • 89k • 130

BAAI/bge-reranker-v2.5-gemma2-lightweight

Text Classification • Updated Sep 6 • 16.8k • 42

BAAI/IndustryCorpus

Viewer • Updated Jul 23 • 595M • 6.86k • 45

jspringer/echo-mistral-7b-instruct-lasttoken

Feature Extraction • Updated Feb 26 • 418 • 5

BAAI/bge-en-icl

Feature Extraction • Updated Sep 25 • 25.2k • 96

AlekseyKorshuk/full_user_edit_responses-clean

Viewer • Updated Mar 30, 2023 • 364k • 34 • 1

m-a-p/MMRA

Viewer • Updated Jul 31 • 1.02k • 138 • 13

m-a-p/II-Bench

Viewer • Updated Jun 29 • 1.43k • 755 • 8

BEE-spoke-data/fineweb-1000_64k

Viewer • Updated Jun 23 • 2k • 56 • 3

Salesforce/xgen-mm-phi3-mini-instruct-r-v1

Image-Text-to-Text • Updated Sep 18 • 50k • 184

black-forest-labs/FLUX.1-dev

Text-to-Image • Updated Aug 16 • 1.35M • • 6.4k

black-forest-labs/FLUX.1-schnell

Text-to-Image • Updated Aug 16 • 1.95M • • 2.84k

numind/NuExtract

Text Generation • Updated 30 days ago • 2.1k • 203

numind/NuSentiment-multilingual

Feature Extraction • Updated Jan 26 • 153 • 10

HuggingFaceM4/Idefics3-8B-Llama3

Image-Text-to-Text • Updated Sep 18 • 11.8k • 237

aipicasso/megalith-10m-florence2

Viewer • Updated Jul 31 • 9.14M • 120 • 22

ZhengPeng7/BiRefNet

Image Segmentation • Updated 8 days ago • 709k • 241

nvidia/quality-classifier-deberta

Updated Aug 6 • 1.96k • 45

Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clusters

Paper • 2408.04093 • Published Aug 7 • 4

tiiuae/falcon-mamba-7b-4bit

Text Generation • Updated Oct 10 • 177 • 11

nisten/all-human-diseases

Viewer • Updated Aug 19 • 2.2k • 104 • 101

THUDM/LongWriter-6k

Viewer • Updated Aug 14 • 6k • 296 • 169

anthracite-org/Stheno-Data-Filtered

Viewer • Updated Aug 18 • 31.1k • 22 • 14

anthracite-org/kalo-opus-instruct-22k-no-refusal

Viewer • Updated Aug 13 • 22.3k • 172 • 14

anthracite-org/nopm_claude_writing_fixed

Viewer • Updated Aug 18 • 6.35k • 116 • 7

microsoft/Phi-3.5-vision-instruct

Image-Text-to-Text • Updated Sep 26 • 988k • 563

microsoft/Phi-3.5-MoE-instruct

Text Generation • Updated 23 days ago • 44k • 516

fal/AuraFace-v1

Updated Aug 26 • 68

NexaAIDev/Squid

Updated Sep 3 • 42 • 30

Dolphin: Long Context as a New Modality for Energy-Efficient On-Device Language Models

Paper • 2408.15518 • Published Aug 28 • 42

HuggingFaceTB/everyday-conversations-llama3.1-2k

Viewer • Updated Aug 17 • 2.38k • 623 • 77

NousResearch/hermes-function-calling-v1

Viewer • Updated Aug 30 • 11.6k • 588 • 215

multimodalart/product-design

Text-to-Image • Updated Sep 22 • 1.92k • • 29

novateur/WavTokenizer

Text-to-Speech • Updated Sep 27 • 44

facebook/sapiens

Updated Sep 20 • 532 • 217

Shakker-Labs/AWPortrait-FL

Text-to-Image • Updated Sep 5 • 30.8k • 390

sequelbox/Supernova

Viewer • Updated Sep 27 • 178k • 211 • 8

Running

513

🖼💬

Vision Arena (Testing VLMs side-by-side)

mattshumer/Reflection-Llama-3.1-70B

Text Generation • Updated Sep 24 • 2.46k • 1.7k

deepseek-ai/DeepSeek-V2.5

Text Generation • Updated Oct 8 • 10.8k • 589

deepseek-ai/ESFT-vanilla-lite

Text Generation • Updated Jul 23 • 120 • 8

yifeihu/TB-OCR-preview-0.1

Image-Text-to-Text • Updated Sep 6 • 1.99k • 124

gabrielmbmb/distilabel-reflection-tuning

Viewer • Updated Sep 6 • 5 • 81 • 55

TencentARC/Open-MAGVIT2

Image Feature Extraction • Updated Sep 9 • 10

openbmb/MiniCPM3-4B

Text Generation • Updated 2 days ago • 29k • 381

THUDM/LongCite-glm4-9b

Text Generation • Updated Sep 13 • 420 • 26

jinaai/reader-lm-1.5b

Text Generation • Updated Sep 20 • 7.69k • 476

Vchitect/Vchitect-2.0-2B

Text-to-Video • Updated Sep 15 • 41 • 34

tencent/DepthCrafter

Depth Estimation • Updated Sep 24 • 306k • 64

mistralai/Pixtral-12B-2409

Updated 23 days ago • 489

stepfun-ai/GOT-OCR2_0

Image-Text-to-Text • Updated Sep 18 • 385k • 1.19k

StoryMaker: Towards Holistic Consistent Characters in Text-to-image Generation

Paper • 2409.12576 • Published Sep 19 • 15

THUdyh/Oryx-7B

Text Generation • Updated Sep 25 • 274 • 11

THUdyh/Oryx-7B-Image

Text Generation • Updated Sep 23 • 13 • 3

THUdyh/Oryx-ViT

Image Classification • Updated Sep 23 • 5

BAAI/SegGPT

Updated Apr 21, 2023 • 17

Salesforce/fineweb_deduplicated

Viewer • Updated Sep 14 • 6.43B • 2k • 27

KbsdJames/Omni-MATH

Viewer • Updated Oct 12 • 4.43k • 529 • 57

BAAI/Emu3-Gen

Any-to-Any • Updated 24 days ago • 15.4k • 186

CultriX/elitebabes-flux

Text-to-Image • Updated Sep 20 • 1.39k • • 13

RED-AIGC/StoryMaker

Text-to-Image • Updated 7 days ago • 621 • 70

google/frames-benchmark

Viewer • Updated Oct 15 • 824 • 1.68k • 164

Anthropic/discrim-eval

Viewer • Updated Jan 5 • 18.9k • 663 • 43

facebook/sam2.1-hiera-large

Mask Generation • Updated Sep 24 • 14k • 38

Zyphra/Zamba2-2.7B-instruct

Text Generation • Updated 29 days ago • 3.69k • 76

princeton-nlp/Llama-3-8B-ProLong-512k-Instruct

Updated 16 days ago • 9.41k • 17

jxm/cde-small-v1

Feature Extraction • Updated 17 days ago • 13.5k • 267

PrincetonPLI/Instruct-SkillMix-SDD

Viewer • Updated Sep 9 • 8k • 68 • 4

THUDM/cogvlm2-llama3-caption

Video-Text-to-Text • Updated Sep 26 • 4.53k • 57

julien040/hacker-news-posts

Viewer • Updated Jun 6, 2023 • 4.01M • 79 • 5

princeton-nlp/Llama-3-8B-ProLong-512k-Base

Updated 16 days ago • 87 • 5

LLM360/TxT360

Preview • Updated 8 days ago • 384k • 211

bingbangboom/flux-waterscape

Text-to-Image • Updated Oct 10 • 1.54k • • 13

facebook/Self-taught-evaluator-DPO-data

Viewer • Updated Sep 30 • 57.5k • 100 • 30

facebook/layerskip-llama2-13B

Text Generation • Updated 28 days ago • 71 • 5

ibm-granite/granite-8b-code-instruct-accelerator

Updated May 29 • 30 • 1

peakji/steiner-32b-preview

Updated 26 days ago • 98 • 39

CohereForAI/aya-expanse-32b

Text Generation • Updated 15 days ago • 26.8k • 167

CohereForAI/aya-expanse-8b

Text Generation • Updated 17 days ago • 39.8k • 274

Mono-InternVL: Pushing the Boundaries of Monolithic Multimodal Large Language Models with Endogenous Visual Pre-training

Paper • 2410.08202 • Published Oct 10 • 1

McGill-NLP/FaithDial

Viewer • Updated Feb 5, 2023 • 32.3k • 194 • 17

relaxml/Llama-3.1-8b-Instruct-QTIP-4Bit

Updated 19 days ago • 69 • 2

Dualformer: Controllable Fast and Slow Thinking by Learning with Randomized Reasoning Traces

Paper • 2410.09918 • Published Oct 13 • 3

GAIR/o1-journey

Viewer • Updated Oct 16 • 327 • 1.08k • 73

marcelbinz/Psych-101

Viewer • Updated 14 days ago • 60.1k • 263 • 34

nvidia/Nemotron-4-Mini-Hindi-4B-Base

Updated 24 days ago • 27 • 9

nvidia/Nemotron-4-Mini-Hindi-4B-Instruct

Updated 1 day ago • 40 • 8

Etched/oasis-500m

Updated 12 days ago • 3.72k • 396

HuggingFaceTB/SmolLM2-1.7B-Instruct

Text Generation • Updated 11 days ago • 59.6k • • 331

tencent/Tencent-Hunyuan-Large

Text Generation • Updated 9 days ago • 118 • 455

THUDM/webrl-llama-3.1-8b

Updated 10 days ago • 37 • 2

THUDM/webrl-glm-4-9b

Updated 11 days ago • 38 • 6

hbseong/HarmAug-Guard

Text Classification • Updated Oct 14 • 232 • 25

BAAI/IndustryCorpus2

Viewer • Updated about 9 hours ago • 826M • 12.8k • 33

qq8933/OpenLongCoT-Pretrain

Viewer • Updated 19 days ago • 103k • 453 • 74

microsoft/maira-2

Text Generation • Updated 26 days ago • 1.41k • 32

LLM2CLIP: Powerful Language Model Unlock Richer Visual Representation

Paper • 2411.04997 • Published 9 days ago • 29

microsoft/orca-agentinstruct-1M-v1

Viewer • Updated 15 days ago • 1.05M • 245 • 75

Nexusflow/Athene-V2-Chat

Text Generation • Updated 1 day ago • 335 • 64

Nexusflow/Athene-V2-Agent

Text Generation • Updated 1 day ago • 202 • 37

numind/NuExtract-1.5-tiny

Text Generation • Updated 2 days ago • 1.3k • 9