Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
automated-research-group
's Collections
Models
Eval Datasets
Eval Datasets
updated
Jan 11
Upvote
-
openai/openai_humaneval
Viewer
•
Updated
Jan 4
•
164
•
150k
•
247
google-research-datasets/mbpp
Viewer
•
Updated
Jan 4
•
1.4k
•
156k
•
143
ybisk/piqa
Updated
Jan 18
•
116k
•
85
lighteval/siqa
Viewer
•
Updated
Oct 7, 2023
•
35.4k
•
5.22k
•
4
Rowan/hellaswag
Viewer
•
Updated
Sep 28, 2023
•
60k
•
99.2k
•
94
allenai/winogrande
Updated
Jan 18
•
84.2k
•
57
allenai/ai2_arc
Viewer
•
Updated
Dec 21, 2023
•
7.79k
•
127k
•
142
allenai/openbookqa
Viewer
•
Updated
Jan 4
•
11.9k
•
36.2k
•
79
tau/commonsense_qa
Viewer
•
Updated
Jan 4
•
12.1k
•
15.8k
•
74
google-research-datasets/natural_questions
Viewer
•
Updated
Mar 11
•
26.3k
•
6.45k
•
85
mandarjoshi/trivia_qa
Viewer
•
Updated
Jan 5
•
848k
•
78.5k
•
95
rajpurkar/squad
Viewer
•
Updated
Mar 4
•
98.2k
•
54.5k
•
264
allenai/quac
Updated
Jan 18
•
510
•
28
google/boolq
Viewer
•
Updated
Jan 22
•
12.7k
•
8.41k
•
62
openai/gsm8k
Viewer
•
Updated
Jan 4
•
17.6k
•
204k
•
413
hendrycks/competition_math
Updated
Jun 8, 2023
•
25.3k
•
126
cais/mmlu
Viewer
•
Updated
Mar 8
•
231k
•
68.6k
•
322
maveriq/bigbenchhard
Viewer
•
Updated
Sep 29, 2023
•
6.51k
•
1.66k
•
17
baber/agieval
Updated
Oct 26, 2023
•
80
•
4
Upvote
-
Share collection
View history
Collection guide
Browse collections