diwank
's Collections
Preview
•
Updated
•
342
•
75
argilla/intel-orca-dpo-pairs-helm-instruct
Viewer
•
Updated
•
5
•
47
•
1
argilla/OpenHermes2.5-dpo-binarized-alpha
Viewer
•
Updated
•
9.79k
•
71
•
64
argilla/ultrafeedback-critique
Viewer
•
Updated
•
253k
•
58
•
4
argilla/ultrafeedback-binarized-preferences-cleaned
Viewer
•
Updated
•
60.9k
•
7k
•
124
ai2lumos/lumos_maths_plan_onetime
Viewer
•
Updated
•
19.8k
•
53
•
2
ai2lumos/lumos_unified_plan_iterative
Viewer
•
Updated
•
55.4k
•
50
•
2
ai2lumos/lumos_complex_qa_plan_onetime
Viewer
•
Updated
•
19.4k
•
56
•
3
Viewer
•
Updated
•
10k
•
226
•
28
lmsys/mt_bench_human_judgments
Viewer
•
Updated
•
5.76k
•
384
•
112
lmsys/chatbot_arena_conversations
Viewer
•
Updated
•
33k
•
567
•
337
Nexusflow/Starling-LM-7B-beta
Text Generation
•
Updated
•
6.22k
•
341
Qwen/Qwen1.5-32B
Text Generation
•
Updated
•
13k
•
81
vicgalle/configurable-system-prompt-multitask
Viewer
•
Updated
•
1.95k
•
165
•
19
paraloq/json_data_extraction
Viewer
•
Updated
•
484
•
69
•
16
Viewer
•
Updated
•
479
•
59
•
4
iamtarun/python_code_instructions_18k_alpaca
Viewer
•
Updated
•
18.6k
•
2.05k
•
228
LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement
Paper
•
2403.15042
•
Published
•
25
Viewer
•
Updated
•
2.35k
•
39
•
1
Paper
•
2402.12219
•
Published
•
15
Viewer
•
Updated
•
20.2k
•
88
•
30
M4-ai/prm_dpo_pairs_cleaned
Viewer
•
Updated
•
7.99k
•
57
•
11
SanjiWatsuki/Kunoichi-DPO-v2-7B
Text Generation
•
Updated
•
886
•
82
Viewer
•
Updated
•
17.3k
•
511
•
20
mlabonne/orpo-dpo-mix-40k
Viewer
•
Updated
•
44.2k
•
1.47k
•
242
Viewer
•
Updated
•
529k
•
1.28k
•
119
meta-llama/Meta-Llama-3-8B
Text Generation
•
Updated
•
671k
•
5.81k
Viewer
•
Updated
•
149k
•
80
•
7
FreedomIntelligence/evol-instruct-hindi
Viewer
•
Updated
•
59k
•
9
•
2
totally-not-an-llm/EverythingLM-data-V3
Viewer
•
Updated
•
1.07k
•
60
•
31
RUCAIBox/Story-Generation
Updated
•
69
•
11
imone/Llama-3-8B-fixed-special-embedding
Text Generation
•
Updated
•
1.02k
•
15
Viewer
•
Updated
•
49.6k
•
390
•
108
Norquinal/claude_multiround_chat_30k
Viewer
•
Updated
•
32.2k
•
44
•
51
Norquinal/claude_multi_instruct_30k
Viewer
•
Updated
•
32.2k
•
20
•
10
Viewer
•
Updated
•
1.72M
•
35
•
9
Locutusque/OpenCerebrum-2.0-SFT
Viewer
•
Updated
•
6.4k
•
51
•
4
Locutusque/OpenCerebrum-2.0-DPO
Viewer
•
Updated
•
720
•
45
•
4
Preview
•
Updated
•
276
•
12
Preview
•
Updated
•
91
•
26
gradientai/Llama-3-70B-Instruct-Gradient-262k
Text Generation
•
Updated
•
222
•
55
princeton-nlp/QuRating-GPT3.5-Judgments
Viewer
•
Updated
•
250k
•
42
•
5
Viewer
•
Updated
•
1.46M
•
39
•
15
mustafaaljadery/gemma-2B-10M
jondurbin/airoboros-70b-3.3
Text Generation
•
Updated
•
2.55k
•
14
princeton-nlp/Llama-3-Instruct-8B-SimPO
Text Generation
•
Updated
•
2.82k
•
55
Viewer
•
Updated
•
21.4k
•
14.4k
•
362
nvidia/Nemotron-4-340B-Reward
Updated
•
360
•
109
Magpie-Align/Magpie-Pro-MT-300K-v0.1
Viewer
•
Updated
•
300k
•
526
•
28
Magpie-Align/Llama-3-8B-Magpie-Align-SFT-v0.1
Text Generation
•
Updated
•
2.59k
•
4
nvidia/Aegis-AI-Content-Safety-Dataset-1.0
Viewer
•
Updated
•
12k
•
966
•
44
Salesforce/xlam-function-calling-60k
Viewer
•
Updated
•
60k
•
2.41k
•
383
Viewer
•
Updated
•
20.4M
•
7.82k
•
542
diwank/llmlingua-compressed-text
Viewer
•
Updated
•
222k
•
44
•
2
diwank/python-code-execution-output
Viewer
•
Updated
•
3.61k
•
44
GUI Odyssey: A Comprehensive Dataset for Cross-App GUI Navigation on
Mobile Devices
Paper
•
2406.08451
•
Published
•
23
Viewer
•
Updated
•
99.5k
•
357
•
18
cognitivecomputations/samantha-1.5
Viewer
•
Updated
•
327
•
51
•
11
Viewer
•
Updated
•
728
•
50
•
8
HannahRoseKirk/prism-alignment
Viewer
•
Updated
•
77.9k
•
1.08k
•
60
RLHFlow/ArmoRM-Llama3-8B-v0.1
Text Classification
•
Updated
•
13.1k
•
146
sfairXC/FsfairX-LLaMA3-RM-v0.1
Text Classification
•
Updated
•
19.1k
•
48
PKU-Alignment/PKU-SafeRLHF-30K
Viewer
•
Updated
•
29.9k
•
263
•
8
instruction-pretrain/ft-instruction-synthesizer-collection
Viewer
•
Updated
•
249k
•
264
•
58
Viewer
•
Updated
•
11.1M
•
671
•
53
Viewer
•
Updated
•
68.8k
•
101k
•
21
Viewer
•
Updated
•
12.7k
•
19
•
5
imbue/human_question_quality_judgments
Viewer
•
Updated
•
167k
•
40
•
8
Viewer
•
Updated
•
54k
•
55
•
19
imbue/high_quality_public_evaluations
Viewer
•
Updated
•
12.8k
•
40
•
6
imbue/high_quality_private_evaluations
Viewer
•
Updated
•
10.6k
•
134
•
8
google/gemma-2-27b
Text Generation
•
Updated
•
19.7k
•
176
Viewer
•
Updated
•
1.46M
•
2.75k
•
4
UCLA-AGI/Llama-3-Instruct-8B-SPPO-Iter3
Text Generation
•
Updated
•
6.84k
•
77
Viewer
•
Updated
•
375k
•
5.75k
•
451
Scaling Synthetic Data Creation with 1,000,000,000 Personas
Paper
•
2406.20094
•
Published
•
94
Viewer
•
Updated
•
1.24M
•
118
•
7
Viewer
•
Updated
•
1.25M
•
177
•
5
Viewer
•
Updated
•
2.05M
•
129
•
3
Viewer
•
Updated
•
326k
•
15
•
8
hubertsiuzdak/snac_24khz
Updated
•
13.6k
•
17
hubertsiuzdak/snac_32khz
hubertsiuzdak/snac_44khz
Updated
•
1.12k
•
7
facebook/chameleon-30b
Image-Text-to-Text
•
Updated
•
670
•
82
facebook/chameleon-7b
Image-Text-to-Text
•
Updated
•
18.7k
•
164
gokaygokay/random_instruct_docci
Viewer
•
Updated
•
14.6k
•
108
•
5
internlm/internlm2_5-7b
Text Generation
•
Updated
•
4.79k
•
15
Gryphe/Opus-WritingPrompts
Viewer
•
Updated
•
14.9k
•
678
•
31
Viewer
•
Updated
•
3k
•
97
•
9
Are You Sure? Rank Them Again: Repeated Ranking For Better Preference
Datasets
Paper
•
2405.18952
•
Published
•
10
OpenGVLab/InternVL2-4B
Image-Text-to-Text
•
Updated
•
92.3k
•
42
OpenGVLab/InternVL2-Llama3-76B
Image-Text-to-Text
•
Updated
•
214k
•
203
QuasarResearch/apollo-preview-v0.2
Viewer
•
Updated
•
51.4k
•
427
•
62
fireworks-ai/nexus_parallel_messages
Viewer
•
Updated
•
70
•
39
•
6
fireworks-ai/nexus_parallel_functions
Viewer
•
Updated
•
29
•
40
•
4
Viewer
•
Updated
•
539
•
46
•
22
Viewer
•
Updated
•
18.6k
•
214
•
7
Viewer
•
Updated
•
259
•
82
•
2
Viewer
•
Updated
•
486k
•
90
•
38
Viewer
•
Updated
•
1.75M
•
255
•
78
Viewer
•
Updated
•
860k
•
2.93k
•
205
Gryphe/Sonnet3.5-SlimOrcaDedupCleaned
Viewer
•
Updated
•
181k
•
127
•
76
chargoddard/WebInstructSub-prometheus
Viewer
•
Updated
•
2.39M
•
178
•
16
Viewer
•
Updated
•
1.96k
•
50
•
29
Viewer
•
Updated
•
294k
•
101
•
24
chargoddard/chai-feedback-pairs
Viewer
•
Updated
•
30.1k
•
33
•
5
nayohan/multi_session_chat
Viewer
•
Updated
•
23.4k
•
173
•
1
nvidia/Mistral-NeMo-12B-Instruct
Updated
•
177
•
138
nvidia/Mistral-NeMo-12B-Base
meta-llama/Llama-3.1-8B
Text Generation
•
Updated
•
1.11M
•
1.05k
meta-llama/Prompt-Guard-86M
Text Classification
•
Updated
•
128k
•
189
Viewer
•
Updated
•
6.41k
•
96
•
29
mistralai/Mistral-Large-Instruct-2407
Updated
•
27.7k
•
798
Symbol-LLM/Symbolic_Collection
Viewer
•
Updated
•
975k
•
83
•
7
Viewer
•
Updated
•
100k
•
9.08k
•
116
roborovski/dolly-entity-extraction
Viewer
•
Updated
•
5.95k
•
171
•
2
kalomaze/Opus_Instruct_25k
Viewer
•
Updated
•
25.1k
•
74
•
31
Vezora/Code-Preference-Pairs
Viewer
•
Updated
•
54k
•
74
•
14
Nexusflow/Athene-70B
Text Generation
•
Updated
•
8.59k
•
187
arcee-ai/Arcee-Spark
Text Generation
•
Updated
•
3.12k
•
86
Viewer
•
Updated
•
270k
•
74
•
7
OpenBuddy/openbuddy-llama3.1-8b-v22.2-131k
Text Generation
•
Updated
•
249
•
2
google/gemma-2-2b
Text Generation
•
Updated
•
12.4M
•
410
google/gemma-scope
google/shieldgemma-2b
Text Generation
•
Updated
•
4.58k
•
46
Viewer
•
Updated
•
11.2k
•
51
•
6
argilla/magpie-ultra-v0.1
Viewer
•
Updated
•
50k
•
423
•
210
mlabonne/Llama-3.1-70B-Instruct-lorablated-GGUF
Updated
•
3.35k
•
36
Viewer
•
Updated
•
55.1k
•
129
•
88
internlm/internlm2_5-20b
Text Generation
•
Updated
•
333
•
16
Viewer
•
Updated
•
1.02k
•
123
•
13
Viewer
•
Updated
•
2.39M
•
80
•
8
Viewer
•
Updated
•
6k
•
351
•
169
Viewer
•
Updated
•
282
•
41
•
1
Gryphe/Sonnet3.5-Charcard-Roleplay
Updated
•
317
•
37
NousResearch/hermes-function-calling-v1
Viewer
•
Updated
•
11.6k
•
536
•
209
AlgorithmicResearchGroup/ArXivDLInstruct
Viewer
•
Updated
•
778k
•
162
•
13
upstage/solar-pro-preview-instruct
Text Generation
•
Updated
•
1.47k
•
420
mistral-community/pixtral-12b-240910
Image-Text-to-Text
•
Updated
•
382
arcee-ai/Llama-3.1-SuperNova-Lite
Text Generation
•
Updated
•
9.86k
•
171
Skywork/Skywork-Reward-Gemma-2-27B
Text Classification
•
Updated
•
221k
•
36
Viewer
•
Updated
•
59.4k
•
213
•
61
Viewer
•
Updated
•
29.9k
•
187
•
57
argilla/FinePersonas-v0.1
Viewer
•
Updated
•
21.1M
•
4.48k
•
317
Training Language Models to Self-Correct via Reinforcement Learning
Paper
•
2409.12917
•
Published
•
134
bespokelabs/Bespoke-MiniCheck-7B
Text Classification
•
Updated
•
7.93k
•
46
Viewer
•
Updated
•
13.6k
•
117
•
19
mlabonne/open-perfectblend
Viewer
•
Updated
•
1.42M
•
953
•
43
rombodawg/Everything_Instruct
Viewer
•
Updated
•
4.05M
•
4.08k
•
41
Viewer
•
Updated
•
290k
•
672
•
24