multi-part model

by goodasdgood - opened Sep 12

Discussion

goodasdgood

Sep 12

How do I use a multi-part model?

goodasdgood

Sep 12

DeepSeek-V2.5-IQ1_M-00001-of-00002.gguf

DeepSeek-V2.5-IQ1_M-00002-of-00002.gg

goodasdgood

Sep 12

not run on colab

bartowski

Owner Sep 13

You just have to load the first and then any llama.cpp tool will look for and load the second

goodasdgood

Sep 14

from llama_cpp import Llama

llm = Llama.from_pretrained(
repo_id="bartowski/DeepSeek-V2.5-GGUF",
filename="DeepSeek-V2.5-IQ1_M/DeepSeek-V2.5-IQ1_M-00001-of-00002.gguf",
)

llm.create_chat_completion(
messages = [
{
"role": "user",
"content": "What is the capital of France?"
}
]
) Should I use the code and it will download the two parts by itself and run them without me merging them together?

goodasdgood

Sep 14

!./llama-gguf-split --merge DeepSeek-V2.5-IQ1_M-00001-of-00002.gguf DeepSeek-V2.5-IQ1_M-00002-of-00002.gguf DeepSeek-V2.5-IQ1_M.gguf

Every time I merge, the second part is deleted.

bartowski

Owner Sep 15

I think you're meant to only pass the first part like this:

./llama-gguf-split --merge DeepSeek-V2.5-IQ1_M-00001-of-00002.gguf DeepSeek-V2.5-IQ1_M.gguf

goodasdgood

Sep 19

https://huggingface.co/goodasdgood/dd/tree/main

maveriq

Oct 10

I would like to add here that the files downloaded using the command huggingface-cli download bartowski/DeepSeek-V2.5-GGUF --include "DeepSeek-V2.5-Q8_0/*" --local-dir ./will give you symlinks to the huggingface cache and the merge command didn't work with symlinks. I copied the actual files to a directory and that worked fine during the merge

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment