Tried with MPS?

by kronosprime - opened Nov 8, 2023

Nov 8, 2023

I adjusted the code to replace CUDA references with MPS, but after 20 minutes on the fastest M2 with 96GB the generations hadn't fully finished. So I wanted to ask if anyone else had the same result, or did it work for you?

migtissera

Owner Nov 8, 2023

You might want to run the GGUF version then, no? Not sure whether @TheBloke has quantized this.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment