update readme
Browse files
README.md
CHANGED
@@ -14,6 +14,8 @@ inference:
|
|
14 |
|
15 |
This is a DenseFormer implementation of Mistral-7B-v0.1. The details about DenseFormer are in the [paper](https://arxiv.org/abs/2402.02622).
|
16 |
|
|
|
|
|
17 |
This model WILL require additional pretraining to see the improvements indicated by the DenseFormer paper. Per the paper:
|
18 |
|
19 |
```
|
|
|
14 |
|
15 |
This is a DenseFormer implementation of Mistral-7B-v0.1. The details about DenseFormer are in the [paper](https://arxiv.org/abs/2402.02622).
|
16 |
|
17 |
+
You will need to use `trust_remote_code=True` to load this model.
|
18 |
+
|
19 |
This model WILL require additional pretraining to see the improvements indicated by the DenseFormer paper. Per the paper:
|
20 |
|
21 |
```
|