YugoGPT-Quantized-GGUF / YugoGPT-Quantized.GGUF.Q5_K_M.gguf

Commit History

q5_k_m: Recommended. Uses Q6_K for half of the attention.wv and feed_forward.w2 tensors, else Q5_K
5e97c4c
verified

datatab commited on