LoneStriker/Nous-Hermes-2-Mixtral-8x7B-DPO-4.0bpw-h6-exl2

Hugging Face

Huggingface Run

by GokhanAI - opened Feb 1

Discussion

GokhanAI

Feb 1

How to run this model ? We can run via AutoModelForCausalLM etc. Can you share us ? I am new this format. I am sorry.

LoneStriker

Owner Feb 2

These are exllamav2 quantized versions of the models. You need to use exllamav2 itself to load these models via Python. There are simple examples in the Github project to load the model and run inference. If you want to use a GUI, use exui, ooba's text-generation-webui (with exllamav2 or exllamav2_hf as model loaders), or other packages like tabbyAPI.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment