Failed to quantize

by Suparious - opened Sep 3

Sep 3

I keep trying to quantize all of your models, and every single minitron 4B based on a molestation of the llama 3.1 base model fails with this error:

FourOhFour-Maelstrom_4B
Using CUDA. Available GPU memory: 23.60 GB
Loading checkpoint shards:   0%|                                                                                            | 0/2 [00:00<?, ?it/s]
Quantization failed: Trying to set a tensor of shape torch.Size([1024, 3072]) in "weight" (which has shape torch.Size([768, 3072])), this looks incorrect.
Quantization failed: Trying to set a tensor of shape torch.Size([1024, 3072]) in "weight" (which has shape torch.Size([768, 3072])), this looks incorrect.
An error occurred during the quantization process: Trying to set a tensor of shape torch.Size([1024, 3072]) in "weight" (which has shape torch.Size([768, 3072])), this looks incorrect.

jeiku

Synaptic Forge org Sep 3

Have you tried updating transformers to 4.45.0?

jeiku

Synaptic Forge org Sep 3

•

edited Sep 3

pip install -U transformers@git+https://github.com/huggingface/transformers.git

Suparious

Sep 3

Well, this is the crux of my problem. Advancing my transformers version (requires a force upgrade that breaks any depends.) needs me to change my torch version (cuda), and even when I recompile the AutoAWQ_kernel, the model that I make gives all "!!!!' as the response.

so far, to have a stable quantization pipeline, the best torch I can use is 2.4.0, and transformers 4.44.2

when I just tell the depends to go away and force transformers 4.45.x, like I mentioned earlier, I just get a whole lot of !!!! as the response.

not sure how to proceed. I don't think it is an actual problem with your model, so I'm sorry for venting my frustrations here.

jeiku

Synaptic Forge org Sep 3

My best suggestion is to try another type of quant. Both Exllama and GGUF have been proven to work by members of the Anthracite organization.

Suparious

Sep 3

I understand. I run the SolidRusT (SRT) organization, and we just do AWQ: https://huggingface.co/solidrust

I'll try again at another time. thank-you for your responses.

Suparious changed discussion status to closed Sep 3

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment