TheBloke
/

stable-vicuna-13B-GPTQ

@@ -24,6 +24,20 @@ This model works best with the following prompt template:
 ### Assistant:
 ```
 ## GIBBERISH OUTPUT IN `text-generation-webui`?
 Please read the Provided Files section below. You should use `stable-vicuna-13B-GPTQ-4bit.compat.no-act-order.safetensors` unless you are able to use the latest GPTQ-for-LLaMa code.
@@ -59,20 +73,6 @@ Unless you are able to use the latest GPTQ-for-LLaMa code, please use `stable-vi
     CUDA_VISIBLE_DEVICES=0 python3 llama.py stable-vicuna-13B-HF c4 --wbits 4 --true-sequential --act-order --groupsize 128 --save_safetensors stable-vicuna-13B-GPTQ-4bit.act-order.safetensors
     ```
-## How to easily download and use a model in text-generation-webui
-Load text-generation-webui as you normally do.
-1. Click the **Model tab**.
-2. Under **Download custom model or LoRA**, enter the repo name to download: `TheBloke/stable-vicuna-13B-GPTQ`.
-3. Click **Download**.
-4. Wait until it says it's finished downloading.
-5. As this is a GPTQ model, fill in the `GPTQ parameters` on the right: `Bits = 4`, `Groupsize = 128`, `model_type = Llama`
-6. Now click the **Refresh** icon next to **Model** in the top left.
-7. In the **Model drop-down**: choose the model you just downloaded, eg `stable-vicuna-13B-GPTQ`.
-8. Click **Reload the Model** in the top right.
-9. Once it says it's loaded, click the **Text Generation tab** and enter a prompt!
 ## Manual instructions for `text-generation-webui`
 File `stable-vicuna-13B-GPTQ-4bit.compat.no-act-order.safetensors` can be loaded the same as any other GPTQ file, without requiring any updates to [oobaboogas text-generation-webui](https://github.com/oobabooga/text-generation-webui).

 ### Assistant:
 ```
+## How to easily download and use this model in text-generation-webui
+Load text-generation-webui as you normally do.
+1. Click the **Model tab**.
+2. Under **Download custom model or LoRA**, enter the repo name to download: `TheBloke/stable-vicuna-13B-GPTQ`.
+3. Click **Download**.
+4. Wait until it says it's finished downloading.
+5. As this is a GPTQ model, fill in the `GPTQ parameters` on the right: `Bits = 4`, `Groupsize = 128`, `model_type = Llama`
+6. Now click the **Refresh** icon next to **Model** in the top left.
+7. In the **Model drop-down**: choose the model you just downloaded, eg `stable-vicuna-13B-GPTQ`.
+8. Click **Reload the Model** in the top right.
+9. Once it says it's loaded, click the **Text Generation tab** and enter a prompt!
 ## GIBBERISH OUTPUT IN `text-generation-webui`?
 Please read the Provided Files section below. You should use `stable-vicuna-13B-GPTQ-4bit.compat.no-act-order.safetensors` unless you are able to use the latest GPTQ-for-LLaMa code.
     CUDA_VISIBLE_DEVICES=0 python3 llama.py stable-vicuna-13B-HF c4 --wbits 4 --true-sequential --act-order --groupsize 128 --save_safetensors stable-vicuna-13B-GPTQ-4bit.act-order.safetensors
     ```
 ## Manual instructions for `text-generation-webui`
 File `stable-vicuna-13B-GPTQ-4bit.compat.no-act-order.safetensors` can be loaded the same as any other GPTQ file, without requiring any updates to [oobaboogas text-generation-webui](https://github.com/oobabooga/text-generation-webui).