NOTE: You will need a recent build of llama.cpp to run these quants (i.e. at least commit 494c870
).
GGUF importance matrix (imatrix) quants for https://huggingface.co/fblgit/UNA-SimpleSmaug-34b-v1beta
- The importance matrix was trained for ~50K tokens (105 batches of 512 tokens) using a general purpose imatrix calibration dataset.
- The imatrix is being used on the K-quants as well.
Layers | Context | Template |
---|---|---|
60 |
32768 |
<|startoftext|>[INST] <<SYS>> |
- Downloads last month
- 12
Inference API (serverless) does not yet support gguf models for this pipeline type.
Model tree for dranger003/UNA-SimpleSmaug-34b-v1beta-iMat.GGUF
Base model
jondurbin/bagel-34b-v0.2
Finetuned
abacusai/Smaug-34B-v0.1
Finetuned
fblgit/UNA-SimpleSmaug-34b-v1beta