Adding Evaluation Results

#13
Files changed (1) hide show
  1. README.md +14 -1
README.md CHANGED
@@ -102,4 +102,17 @@ To cite this model, use
102
  journal={arXiv preprint arXiv:2101.00027},
103
  year={2020}
104
  }
105
- ```
 
 
 
 
 
 
 
 
 
 
 
 
 
 
102
  journal={arXiv preprint arXiv:2101.00027},
103
  year={2020}
104
  }
105
+ ```
106
+ # [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard)
107
+ Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_EleutherAI__gpt-neo-2.7B)
108
+
109
+ | Metric | Value |
110
+ |-----------------------|---------------------------|
111
+ | Avg. | 31.71 |
112
+ | ARC (25-shot) | 33.36 |
113
+ | HellaSwag (10-shot) | 56.24 |
114
+ | MMLU (5-shot) | 26.45 |
115
+ | TruthfulQA (0-shot) | 39.78 |
116
+ | Winogrande (5-shot) | 60.06 |
117
+ | GSM8K (5-shot) | 1.29 |
118
+ | DROP (3-shot) | 4.77 |