Eval Datasets - a automated-research-group Collection

Models
Datasets
Spaces
Posts
Docs
Pricing
Log In
Sign Up

automated-research-group 's Collections

Models

Eval Datasets

updated Jan 11

openai/openai_humaneval

Viewer • Updated Jan 4 • 164 • 150k • 247
google-research-datasets/mbpp

Viewer • Updated Jan 4 • 1.4k • 156k • 143
ybisk/piqa

Updated Jan 18 • 116k • 85
lighteval/siqa

Viewer • Updated Oct 7, 2023 • 35.4k • 5.22k • 4
Rowan/hellaswag

Viewer • Updated Sep 28, 2023 • 60k • 99.2k • 94
allenai/winogrande

Updated Jan 18 • 84.2k • 57
allenai/ai2_arc

Viewer • Updated Dec 21, 2023 • 7.79k • 127k • 142
allenai/openbookqa

Viewer • Updated Jan 4 • 11.9k • 36.2k • 79
tau/commonsense_qa

Viewer • Updated Jan 4 • 12.1k • 15.8k • 74
google-research-datasets/natural_questions

Viewer • Updated Mar 11 • 26.3k • 6.45k • 85
mandarjoshi/trivia_qa

Viewer • Updated Jan 5 • 848k • 78.5k • 95
rajpurkar/squad

Viewer • Updated Mar 4 • 98.2k • 54.5k • 264
allenai/quac

Updated Jan 18 • 510 • 28
google/boolq

Viewer • Updated Jan 22 • 12.7k • 8.41k • 62
openai/gsm8k

Viewer • Updated Jan 4 • 17.6k • 204k • 413
hendrycks/competition_math

Updated Jun 8, 2023 • 25.3k • 126
cais/mmlu

Viewer • Updated Mar 8 • 231k • 68.6k • 322
maveriq/bigbenchhard

Viewer • Updated Sep 29, 2023 • 6.51k • 1.66k • 17
baber/agieval

Updated Oct 26, 2023 • 80 • 4

Collection guide
Browse collections

Company

© Hugging Face

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs