Update README.md
Browse files
README.md
CHANGED
@@ -28,19 +28,19 @@ The adapter was trained via SFT on random subsets of the following:
|
|
28 |
## Performance
|
29 |
| Models | Llama2-7B (fp16)| Llama2-7B (HQQ-1bit)| Llama2-7B (HQQ+-1bit)| Quip# (2bit)|
|
30 |
|-------------------|------------------|------------------|------------------|------------------|
|
31 |
-
| Wiki Perpexlity |
|
32 |
-
| VRAM (GB) | 13.5 |
|
33 |
-
| forward time (sec)| <b>0.1<b> | 0.231 | 0.257
|
34 |
|
35 |
| Models | Llama2-7B-chat (fp16)| Llama2-7B-chat (HQQ-1bit)| Llama2-7B-chat (HQQ+-1bit)|
|
36 |
|-------------------|------------------|------------------|------------------|
|
37 |
-
| ARC (25-shot) | 53.67 |
|
38 |
| HellaSwag (10-shot)| 78.56 | 25.66 | 52.96 |
|
39 |
-
| MMLU (5-shot) | 48.16 |
|
40 |
-
| TruthfulQA-MC2 | 45.32 |
|
41 |
-
| Winogrande (5-shot)| 72.53 |
|
42 |
-
| GSM8K (5-shot) | 23.12 |
|
43 |
-
| Average | 53.56 |
|
44 |
|
45 |
## Usage
|
46 |
First, install the latest version of <a href="https://github.com/mobiusml/hqq/">HQQ</a>:
|
|
|
28 |
## Performance
|
29 |
| Models | Llama2-7B (fp16)| Llama2-7B (HQQ-1bit)| Llama2-7B (HQQ+-1bit)| Quip# (2bit)|
|
30 |
|-------------------|------------------|------------------|------------------|------------------|
|
31 |
+
| Wiki Perpexlity | 5.18 | 9866 | <b>8.53</b> | 8.54 |
|
32 |
+
| VRAM (GB) | 13.5 | <b>1.76</b> | 1.85 | 2.72 |
|
33 |
+
| forward time (sec)| <b>0.1<b> | 0.231 | 0.257 | 0.353 |
|
34 |
|
35 |
| Models | Llama2-7B-chat (fp16)| Llama2-7B-chat (HQQ-1bit)| Llama2-7B-chat (HQQ+-1bit)|
|
36 |
|-------------------|------------------|------------------|------------------|
|
37 |
+
| ARC (25-shot) | 53.67 | 21.59 | 31.14 |
|
38 |
| HellaSwag (10-shot)| 78.56 | 25.66 | 52.96 |
|
39 |
+
| MMLU (5-shot) | 48.16 | 25.08 | 26.54 |
|
40 |
+
| TruthfulQA-MC2 | 45.32 | 47.81 | 43.16 |
|
41 |
+
| Winogrande (5-shot)| 72.53 | 49.72 | 60.54 |
|
42 |
+
| GSM8K (5-shot) | 23.12 | 0 | 11 |
|
43 |
+
| Average | 53.56 | 28.31 | 37.56 |
|
44 |
|
45 |
## Usage
|
46 |
First, install the latest version of <a href="https://github.com/mobiusml/hqq/">HQQ</a>:
|