Zoyd commited on
Commit
c7e3c4f
1 Parent(s): 9453182

Upload folder using huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +17 -0
README.md CHANGED
@@ -12,6 +12,23 @@ tags:
12
  base_model:
13
  - prince-canuma/Llama-3-6B-v0
14
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
15
 
16
  # Model Summary
17
  <img src="images/llama-3-6B icon.jpeg" width="500" alt="Llama-3-6B"/>
 
12
  base_model:
13
  - prince-canuma/Llama-3-6B-v0
14
  ---
15
+ **Exllamav2** quant (**exl2** / **3.5 bpw**) made with ExLlamaV2 v0.0.21
16
+
17
+ Other EXL2 quants:
18
+ | **Quant** | **Model Size** | **lm_head** |
19
+ | ----- | ---------- | ------- |
20
+ |<center>**[2.2](https://huggingface.co/Zoyd/prince-canuma_Llama-3-6B-v0.1-2_2bpw_exl2)**</center> | <center>2787 MB</center> | <center>6</center> |
21
+ |<center>**[2.5](https://huggingface.co/Zoyd/prince-canuma_Llama-3-6B-v0.1-2_5bpw_exl2)**</center> | <center>2959 MB</center> | <center>6</center> |
22
+ |<center>**[3.0](https://huggingface.co/Zoyd/prince-canuma_Llama-3-6B-v0.1-3_0bpw_exl2)**</center> | <center>3259 MB</center> | <center>6</center> |
23
+ |<center>**[3.5](https://huggingface.co/Zoyd/prince-canuma_Llama-3-6B-v0.1-3_5bpw_exl2)**</center> | <center>3583 MB</center> | <center>6</center> |
24
+ |<center>**[3.75](https://huggingface.co/Zoyd/prince-canuma_Llama-3-6B-v0.1-3_75bpw_exl2)**</center> | <center>3739 MB</center> | <center>6</center> |
25
+ |<center>**[4.0](https://huggingface.co/Zoyd/prince-canuma_Llama-3-6B-v0.1-4_0bpw_exl2)**</center> | <center>3895 MB</center> | <center>6</center> |
26
+ |<center>**[4.25](https://huggingface.co/Zoyd/prince-canuma_Llama-3-6B-v0.1-4_25bpw_exl2)**</center> | <center>4051 MB</center> | <center>6</center> |
27
+ |<center>**[5.0](https://huggingface.co/Zoyd/prince-canuma_Llama-3-6B-v0.1-5_0bpw_exl2)**</center> | <center>4519 MB</center> | <center>6</center> |
28
+ |<center>**[6.0](https://huggingface.co/Zoyd/prince-canuma_Llama-3-6B-v0.1-6_0bpw_exl2)**</center> | <center>5247 MB</center> | <center>8</center> |
29
+ |<center>**[6.5](https://huggingface.co/Zoyd/prince-canuma_Llama-3-6B-v0.1-6_5bpw_exl2)**</center> | <center>5548 MB</center> | <center>8</center> |
30
+ |<center>**[8.0](https://huggingface.co/Zoyd/prince-canuma_Llama-3-6B-v0.1-8_0bpw_exl2)**</center> | <center>6436 MB</center> | <center>8</center> |
31
+
32
 
33
  # Model Summary
34
  <img src="images/llama-3-6B icon.jpeg" width="500" alt="Llama-3-6B"/>