stabilityai
/

stablelm-zephyr-3b

Text Generation

Inference Endpoints

Model card Files Files and versions Community

pvduy commited on Dec 2, 2023

Commit

3131f94

•

1 Parent(s): 0cd93b5

Update README.md

Files changed (1) hide show

README.md +3 -3

README.md CHANGED Viewed

@@ -99,7 +99,7 @@ The dataset is comprised of a mixture of open datasets large-scale datasets avai
 | GPT-4 |  -| RLHF |8.99| 95.28|
 ## Other benchmark:
-1. HuggingFace OpenLLM Leaderboard
 | Metric                | Value                     |
 |-----------------------|---------------------------|
 | ARC (25-shot)         |  47.0       |
@@ -110,7 +110,7 @@ The dataset is comprised of a mixture of open datasets large-scale datasets avai
 | GSM8K (5-shot)        | 42.3        |
-2. BigBench:
 - Average: 35.26
 - Details:
@@ -139,7 +139,7 @@ The dataset is comprised of a mixture of open datasets large-scale datasets avai
 | bigbench_tracking_shuffled_objects_seven_objects    | 0       | multiple_choice_grade   | 0.1856| 0.0110 |
 | bigbench_tracking_shuffled_objects_three_objects    | 0       | multiple_choice_grade   | 0.1269| 0.0080 |
-3. AGI:
 - Average: 33.23
 - Details:
 |             Task             |Version| Metric |Value |   |Stderr|

 | GPT-4 |  -| RLHF |8.99| 95.28|
 ## Other benchmark:
+1. **HuggingFace OpenLLM Leaderboard**
 | Metric                | Value                     |
 |-----------------------|---------------------------|
 | ARC (25-shot)         |  47.0       |
 | GSM8K (5-shot)        | 42.3        |
+2. **BigBench**:
 - Average: 35.26
 - Details:
 | bigbench_tracking_shuffled_objects_seven_objects    | 0       | multiple_choice_grade   | 0.1856| 0.0110 |
 | bigbench_tracking_shuffled_objects_three_objects    | 0       | multiple_choice_grade   | 0.1269| 0.0080 |
+3. **AGI Benchmark**:
 - Average: 33.23
 - Details:
 |             Task             |Version| Metric |Value |   |Stderr|