Lewdiculous
commited on
Commit
•
7acbf3a
1
Parent(s):
c3cb0d9
Update README.md
Browse files
README.md
CHANGED
@@ -35,7 +35,7 @@ The **Imatrix** is calculated based on calibration data, and it helps determine
|
|
35 |
The idea is to preserve the most important information during quantization, which can help reduce the loss of model performance, especially when the calibration data is diverse.
|
36 |
[[1]](https://github.com/ggerganov/llama.cpp/discussions/5006) [[2]](https://github.com/ggerganov/llama.cpp/discussions/5263#discussioncomment-8395384)
|
37 |
|
38 |
-
For imatrix data generation, kalomaze's `groups_merged.txt` with
|
39 |
|
40 |
</details><br>
|
41 |
|
|
|
35 |
The idea is to preserve the most important information during quantization, which can help reduce the loss of model performance, especially when the calibration data is diverse.
|
36 |
[[1]](https://github.com/ggerganov/llama.cpp/discussions/5006) [[2]](https://github.com/ggerganov/llama.cpp/discussions/5263#discussioncomment-8395384)
|
37 |
|
38 |
+
For imatrix data generation, kalomaze's `groups_merged.txt` with additional roleplay chats was used, you can find it [here](https://huggingface.co/Lewdiculous/Nyanade_Stunna-Maid-7B-GGUF-IQ-Imatrix/blob/main/imatrix-with-rp-ex.txt). This was just to add a bit more diversity to the data with the intended use case in mind.
|
39 |
|
40 |
</details><br>
|
41 |
|