Upload folder using huggingface_hub
Browse files
README.md
CHANGED
@@ -12,6 +12,23 @@ tags:
|
|
12 |
base_model:
|
13 |
- prince-canuma/Llama-3-6B-v0
|
14 |
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
15 |
|
16 |
# Model Summary
|
17 |
<img src="images/llama-3-6B icon.jpeg" width="500" alt="Llama-3-6B"/>
|
|
|
12 |
base_model:
|
13 |
- prince-canuma/Llama-3-6B-v0
|
14 |
---
|
15 |
+
**Exllamav2** quant (**exl2** / **3.5 bpw**) made with ExLlamaV2 v0.0.21
|
16 |
+
|
17 |
+
Other EXL2 quants:
|
18 |
+
| **Quant** | **Model Size** | **lm_head** |
|
19 |
+
| ----- | ---------- | ------- |
|
20 |
+
|<center>**[2.2](https://huggingface.co/Zoyd/prince-canuma_Llama-3-6B-v0.1-2_2bpw_exl2)**</center> | <center>2787 MB</center> | <center>6</center> |
|
21 |
+
|<center>**[2.5](https://huggingface.co/Zoyd/prince-canuma_Llama-3-6B-v0.1-2_5bpw_exl2)**</center> | <center>2959 MB</center> | <center>6</center> |
|
22 |
+
|<center>**[3.0](https://huggingface.co/Zoyd/prince-canuma_Llama-3-6B-v0.1-3_0bpw_exl2)**</center> | <center>3259 MB</center> | <center>6</center> |
|
23 |
+
|<center>**[3.5](https://huggingface.co/Zoyd/prince-canuma_Llama-3-6B-v0.1-3_5bpw_exl2)**</center> | <center>3583 MB</center> | <center>6</center> |
|
24 |
+
|<center>**[3.75](https://huggingface.co/Zoyd/prince-canuma_Llama-3-6B-v0.1-3_75bpw_exl2)**</center> | <center>3739 MB</center> | <center>6</center> |
|
25 |
+
|<center>**[4.0](https://huggingface.co/Zoyd/prince-canuma_Llama-3-6B-v0.1-4_0bpw_exl2)**</center> | <center>3895 MB</center> | <center>6</center> |
|
26 |
+
|<center>**[4.25](https://huggingface.co/Zoyd/prince-canuma_Llama-3-6B-v0.1-4_25bpw_exl2)**</center> | <center>4051 MB</center> | <center>6</center> |
|
27 |
+
|<center>**[5.0](https://huggingface.co/Zoyd/prince-canuma_Llama-3-6B-v0.1-5_0bpw_exl2)**</center> | <center>4519 MB</center> | <center>6</center> |
|
28 |
+
|<center>**[6.0](https://huggingface.co/Zoyd/prince-canuma_Llama-3-6B-v0.1-6_0bpw_exl2)**</center> | <center>5247 MB</center> | <center>8</center> |
|
29 |
+
|<center>**[6.5](https://huggingface.co/Zoyd/prince-canuma_Llama-3-6B-v0.1-6_5bpw_exl2)**</center> | <center>5548 MB</center> | <center>8</center> |
|
30 |
+
|<center>**[8.0](https://huggingface.co/Zoyd/prince-canuma_Llama-3-6B-v0.1-8_0bpw_exl2)**</center> | <center>6436 MB</center> | <center>8</center> |
|
31 |
+
|
32 |
|
33 |
# Model Summary
|
34 |
<img src="images/llama-3-6B icon.jpeg" width="500" alt="Llama-3-6B"/>
|