Update README.md
Browse files
README.md
CHANGED
@@ -17,11 +17,11 @@ license: other
|
|
17 |
license_name: llama3
|
18 |
license_link: LICENSE
|
19 |
---
|
20 |
-
# Updated beta quants based on new fixed tokenizer, only works with
|
21 |
|
22 |
# Quant Infos
|
23 |
|
24 |
-
- Updated for latest bpe pre-tokenizer fixes
|
25 |
- quants done with an importance matrix for improved quantization loss
|
26 |
- K & IQ quants in basically all variants from Q6_K down to IQ1_S
|
27 |
- fixed end token for instruct mode (<|eot_id|>[128009])
|
|
|
17 |
license_name: llama3
|
18 |
license_link: LICENSE
|
19 |
---
|
20 |
+
# Updated beta quants based on new fixed tokenizer, only works with in-development branch gg/bpe-preprocess
|
21 |
|
22 |
# Quant Infos
|
23 |
|
24 |
+
- Updated for latest bpe pre-tokenizer fixes https://github.com/ggerganov/llama.cpp/pull/6920
|
25 |
- quants done with an importance matrix for improved quantization loss
|
26 |
- K & IQ quants in basically all variants from Q6_K down to IQ1_S
|
27 |
- fixed end token for instruct mode (<|eot_id|>[128009])
|