koesn
/

NeuralMaxime-7B-slerp-GGUF

mlabonne/AlphaMonarch-7B

mlabonne/NeuralMonarch-7B

Inference Endpoints

Model card Files Files and versions Community

koesn commited on Mar 1

Commit

353a172

•

1 Parent(s): a997dcb

Update README.md

Files changed (1) hide show

README.md +11 -10

README.md CHANGED Viewed

@@ -12,16 +12,17 @@ This repo contains GGUF format model files for NeuralMaxime-7B-slerp-GGUF.
 ## Files Provided
-|                Name               | Quant  | Bits | File Size |              Remark              |
-| --------------------------------- | ------ | ---- | --------- | -------------------------------- |
-| neuralmaxime-7b-slerp.IQ3_S.gguf  | IQ3_S  |  3   |  3.18 GB  | 3.44 bpw quantization            |
-| neuralmaxime-7b-slerp.IQ3_M.gguf  | IQ3_M  |  3   |  3.28 GB  | 3.66 bpw quantization mix        |
-| neuralmaxime-7b-slerp.Q4_0.gguf   | Q4_0   |  4   |  4.11 GB  | 3.56G, +0.2166 ppl               |
-| neuralmaxime-7b-slerp.IQ4_NL.gguf | IQ4_NL |  4   |  4.16 GB  | 4.25 bpw non-linear quantization |
-| neuralmaxime-7b-slerp.Q4_K_M.gguf | Q4_K_M |  4   |  4.37 GB  | 3.80G, +0.0532 ppl               |
-| neuralmaxime-7b-slerp.Q5_K_M.gguf | Q5_K_M |  5   |  5.13 GB  | 4.45G, +0.0122 ppl               |
-| neuralmaxime-7b-slerp.Q6_K.gguf   | Q6_K   |  6   |  5.94 GB  | 5.15G, +0.0008 ppl               |
-| neuralmaxime-7b-slerp.Q8_0.gguf   | Q8_0   |  8   |  7.70 GB  | 6.70G, +0.0004 ppl               |
 ## Parameters

 ## Files Provided
+|                Name                |  Quant  | Bits | File Size |              Remark              |
+| ---------------------------------- | ------- | ---- | --------- | -------------------------------- |
+| neuralmaxime-7b-slerp.IQ3_XXS.gguf | IQ3_XXS |  3   |  3.02 GB  | 3.06 bpw quantization            |
+| neuralmaxime-7b-slerp.IQ3_S.gguf   | IQ3_S   |  3   |  3.18 GB  | 3.44 bpw quantization            |
+| neuralmaxime-7b-slerp.IQ3_M.gguf   | IQ3_M   |  3   |  3.28 GB  | 3.66 bpw quantization mix        |
+| neuralmaxime-7b-slerp.Q4_0.gguf    | Q4_0    |  4   |  4.11 GB  | 3.56G, +0.2166 ppl               |
+| neuralmaxime-7b-slerp.IQ4_NL.gguf  | IQ4_NL  |  4   |  4.16 GB  | 4.25 bpw non-linear quantization |
+| neuralmaxime-7b-slerp.Q4_K_M.gguf  | Q4_K_M  |  4   |  4.37 GB  | 3.80G, +0.0532 ppl               |
+| neuralmaxime-7b-slerp.Q5_K_M.gguf  | Q5_K_M  |  5   |  5.13 GB  | 4.45G, +0.0122 ppl               |
+| neuralmaxime-7b-slerp.Q6_K.gguf    | Q6_K    |  6   |  5.94 GB  | 5.15G, +0.0008 ppl               |
+| neuralmaxime-7b-slerp.Q8_0.gguf    | Q8_0    |  8   |  7.70 GB  | 6.70G, +0.0004 ppl               |
 ## Parameters