AWQ/GPTQ Plans?
#1
by
ZQ-Dev
- opened
First off, amazing work! Thank you for creating this finetune and releasing it to the wild. Hats off to Nous.
Are there any plans to create and release the model in quant formats other than GGUF/FP8? AWQ and GPTQ in particular.
The 70B and 8B have gguf's, 405b only fp8, and thats all we can do for now.
teknium
changed discussion status to
closed