Necessary hardware for Operating the 34B Model

#40

by blurjp - opened Nov 29, 2023

Nov 29, 2023

I currently use a 4090, but the inference process is extremely slow. Is it impractical to expect this model to run efficiently on just a single 4090?

ShampX

Dec 10, 2023

Did you solve that? I have a same problem.

Jan 3

You can use these 2 bit versions made with quip#. Inference is slower than usual but it should work on a single 4090.

YShow

Apr 30

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment