|
|
|
Qwen2.5-1.5B-Instruct |
|
Quant Size (MB) PPL Size (%) Accuracy (%) PPL error rate |
|
IQ1_S 417 193.6245 1.77149 |
|
IQ1_M 443 66.9068 0.52878 |
|
IQ2_XXS 488 33.3356 0.25559 |
|
IQ2_XS 525 20.2870 0.14936 |
|
IQ2_S 538 18.2927 0.13380 |
|
IQ2_M 574 15.4838 0.11113 |
|
Q2_K_S 611 16.0169 0.11623 |
|
IQ3_XXS 638 12.3935 0.08770 |
|
Q2_K 645 14.1657 0.10105 |
|
IQ3_XS 698 11.7112 0.08256 |
|
Q3_K_S 726 12.4782 0.08842 |
|
IQ3_S 728 11.4241 0.07977 |
|
IQ3_M 741 11.4058 0.07862 |
|
Q3_K_M 786 11.3529 0.08018 |
|
Q3_K_L 840 11.1934 0.07913 |
|
IQ4_XS 855 10.5302 0.07351 |
|
IQ4_NL 893 10.5116 0.07335 |
|
Q4_0 895 10.8217 0.07576 |
|
Q4_K_S 897 10.5236 0.07360 |
|
Q4_K_M 941 10.4628 0.07310 |
|
Q4_1 970 10.5100 0.07347 |
|
Q5_K_S 1048 10.2715 0.07148 |
|
Q5_0 1051 10.3196 0.07212 |
|
Q5_K_M 1073 10.2529 0.07143 |
|
Q5_1 1126 10.2624 0.07140 |
|
Q6_K 1214 10.2030 0.07108 |
|
Q8_0 1571 10.1670 0.07068 |
|
F16 2951 10.1512 0.07058 |
|
|