Fix typo
Browse files
README.md
CHANGED
@@ -24,7 +24,7 @@ All quants made using imatrix option with dataset from [here](https://gist.githu
|
|
24 |
|
25 |
## What's new
|
26 |
|
27 |
-
- June
|
28 |
- July 3 2024: Updated the experimental quants to newer method, Q8 for embed/output, yields higher quality at much lower size than f16 (left Q8_0_L since Q8_0 is already Q8 embed/output)
|
29 |
|
30 |
## Prompt format
|
|
|
24 |
|
25 |
## What's new
|
26 |
|
27 |
+
- June 31 2024: Contains latest tokenizer fixes, which addressed a few oddities from the original fix, should be closest to correct performance yet. Also has metadata for SWA and logit softcapping.
|
28 |
- July 3 2024: Updated the experimental quants to newer method, Q8 for embed/output, yields higher quality at much lower size than f16 (left Q8_0_L since Q8_0 is already Q8 embed/output)
|
29 |
|
30 |
## Prompt format
|