Please add to llama.cpp and ollama

#21

by KeilahElla - opened Jun 22

Jun 22

As the title says. It would be great to use this with ollama/llama.cpp. It is usually much faster compared to transformers.

Jun 25

Not sure that claim is super accurate :) torch compile can get you pretty far with transformers

Jun 26

It will become more convenient to use in ollama. Request support for ollama.

Jul 19

@ArthurZ you are right , but what about when we run it on cpu maybe ollama.cpp work very well what you think?

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment