pmysl
/

c4ai-command-r-plus-GGUF

Text Generation

Inference Endpoints

Model card Files Files and versions Community

pmysl commited on Apr 5

Commit

4caa150

•

1 Parent(s): 25f23b8

Update README.md

Files changed (1) hide show

README.md +7 -0

README.md CHANGED Viewed

@@ -4,4 +4,11 @@ license: cc-by-nc-4.0
 # Command R+ GGUF
 This repository contains experimental GGUF weights that are currently compatible only with the following fork: [https://github.com/Noeda/llama.cpp/tree/53f71f0026cbed4588b2ad16c51db630d2745794](https://github.com/Noeda/llama.cpp/tree/53f71f0026cbed4588b2ad16c51db630d2745794). I will update them once support for Command R+ is merged into the llama.cpp repository

 # Command R+ GGUF
+## Description
 This repository contains experimental GGUF weights that are currently compatible only with the following fork: [https://github.com/Noeda/llama.cpp/tree/53f71f0026cbed4588b2ad16c51db630d2745794](https://github.com/Noeda/llama.cpp/tree/53f71f0026cbed4588b2ad16c51db630d2745794). I will update them once support for Command R+ is merged into the llama.cpp repository
+## Concatenating Weights
+For every variant (except Q2_K), you must concatenate the weights, as they exceed the 50 GB single file size limit on HuggingFace. You can accomplish this using the `cat` command on Linux (example for the Q3 variant):
+```bash
+cat command-r-plus-Q3_K_L-0000* > command-r-plus-Q3_K_L.gguf
+```