maddes8cht
commited on
Commit
•
8a29453
1
Parent(s):
a00a759
"Update README.md"
Browse files
README.md
CHANGED
@@ -12,6 +12,16 @@ I'm constantly enhancing these model descriptions to provide you with the most r
|
|
12 |
Persimmon is a Large language Model from Adept AI. It is trained from Scratch with a context legth of 16k, which is 4 times the context size of LLaMA2 or ChatGPT and 8 times that of GPT-3
|
13 |
|
14 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
15 |
|
16 |
# About GGUF format
|
17 |
|
|
|
12 |
Persimmon is a Large language Model from Adept AI. It is trained from Scratch with a context legth of 16k, which is 4 times the context size of LLaMA2 or ChatGPT and 8 times that of GPT-3
|
13 |
|
14 |
|
15 |
+
---
|
16 |
+
# Brief
|
17 |
+
This is a preview of adepts persimmon base model.
|
18 |
+
It i snot based on the model published at https://huggingface.co/adept/persimmon-8b-base but on the ones released on the tar files in https://github.com/persimmon-ai-labs/adept-inference.
|
19 |
+
As these seems to be slightly different, models based on the huggingface release will follow as soon as possible.
|
20 |
+
## Note: These models do not seem to work with cuda acceleration at the moment. If you are using the Cublas version of Llama.cpp, you need to set `--n-gpu-layers 0` for it to work. (At a later date this may work again with Cuda, so feel free to play with this setting)
|
21 |
+
|
22 |
+
---
|
23 |
+
|
24 |
+
|
25 |
|
26 |
# About GGUF format
|
27 |
|