GTPQ, AWQ, GGUF version request

by Komposter43 - opened Oct 10, 2023

Discussion

Komposter43

Oct 10, 2023

@TheBloke
Would you like to make the quantized version of this model for the community. Thank you.

gotzmann

Oct 10, 2023

@TheBloke please-please

delphijb

Oct 11, 2023

@TheBloke
Hello, this is the futur goat !
please, please, change the priority order, we need a GGUF + AWQ & GPTQ :-)

TheBloke

Oct 11, 2023

I'm having a hard time finding the hw to do multiple quants for 70b models. I ll try to get it done later today

ehartford

Oct 12, 2023

Shouldn't it have a model card first?

Komposter43

Oct 12, 2023

Shouldn't it have a model card first?

It is top1 on leaderboard now: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard

https://huggingface.co/datasets/open-llm-leaderboard/details_ICBU-NPU__FashionGPT-70B-V1.2

ehartford

Oct 12, 2023

That doesn't mean it shouldn't have a model card

gotzmann

Oct 12, 2023

•

edited Oct 12, 2023

I'm having a hard time finding the hw to do multiple quants for 70b models. I ll try to get it done later today

So maybe start with some most popular quants? As for me I'm mostly need ONLY GGUF 4KM for 70B as it's the right size to fit into one 48Gb card or two 24Gb

Komposter43

Oct 12, 2023

That doesn't mean it shouldn't have a model card

How are quantization proccess related to the model card?

You can see card from version 1.1: https://huggingface.co/ICBU-NPU/FashionGPT-70B-V1.1

There are small difference.

TheBloke

Oct 12, 2023

Quants for this model are starting now

gotzmann

Oct 12, 2023

Hmm, looks like prompt format from v1.1 do not work with v1.2 properly :(

Yhyu13

Oct 13, 2023

@TheBloke

Just fond another 70b model steals the #1 on the openllm leaderboard just now https://huggingface.co/ValiantLabs/ShiningValiant/tree/main. Shame it does not have a discussion section, so I have to place the GPTQ request here.

ehartford

Oct 13, 2023

at least it has a model card

@TheBloke

Just fond another 70b model steals the #1 on the openllm leaderboard just now https://huggingface.co/ValiantLabs/ShiningValiant/tree/main. Shame it does not have a discussion section, so I have to place the GPTQ request here.

Komposter43 changed discussion status to closed Oct 16, 2023

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment