Nexesenex
/

MIstral-QUantized-70b_Miqu-1-70b-iMat.GGUF

Model card Files Files and versions Community

Nexesenex commited on Feb 3

Commit

afcc6ac

•

1 Parent(s): 86671a1

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -14,7 +14,7 @@ Miqudev provided Q5_K_M, Q4_K_M, and Q2_K on this page : https://huggingface.co/
 Here, you will find the following quants :
 Full offload possible on 48GB VRAM with a huge context size :
-- Q4_K_S
 - Lower quality : Q3_K_L
 Full offload possible on 36GB VRAM with a variable context size (up to 7168 with Q3_K_M, for example)

 Here, you will find the following quants :
 Full offload possible on 48GB VRAM with a huge context size :
+- Q4_K_S. Note : A Q5_K_S requant compared to the original Q4_K_M quant of Miqudev wouldn't bring much benefit if any, and take much more VRAM, so I didn't do it.
 - Lower quality : Q3_K_L
 Full offload possible on 36GB VRAM with a variable context size (up to 7168 with Q3_K_M, for example)