mwitiderrick
commited on
Commit
•
08d7494
1
Parent(s):
fc9184a
Update README.md
Browse files
README.md
CHANGED
@@ -80,9 +80,10 @@ There are many factors to consider when choosing a university. Here are some tip
|
|
80 |
|
81 |
```bash
|
82 |
git clone https://github.com/neuralmagic/sparseml
|
83 |
-
pip install -e "sparseml[transformers]"
|
84 |
python sparseml/src/sparseml/transformers/sparsification/obcq/obcq.py TinyLlama/TinyLlama-1.1B-Chat-v0.4 open_platypus --recipe recipe.yaml --save True
|
85 |
python sparseml/src/sparseml/transformers/sparsification/obcq/export.py --task text-generation --model_path obcq_deployment
|
86 |
cp deployment/model.onnx deployment/model-orig.onnx
|
|
|
87 |
python onnx_kv_inject.py --input-file deployment/model-orig.onnx --output-file deployment/model.onnx
|
88 |
```
|
|
|
80 |
|
81 |
```bash
|
82 |
git clone https://github.com/neuralmagic/sparseml
|
83 |
+
pip install -e "sparseml[transformers]"
|
84 |
python sparseml/src/sparseml/transformers/sparsification/obcq/obcq.py TinyLlama/TinyLlama-1.1B-Chat-v0.4 open_platypus --recipe recipe.yaml --save True
|
85 |
python sparseml/src/sparseml/transformers/sparsification/obcq/export.py --task text-generation --model_path obcq_deployment
|
86 |
cp deployment/model.onnx deployment/model-orig.onnx
|
87 |
+
wget https://huggingface.co/nm-testing/TinyLlama-1.1B-Chat-v0.4-pruned50-quant/raw/main/recipe.yaml # download recipe
|
88 |
python onnx_kv_inject.py --input-file deployment/model-orig.onnx --output-file deployment/model.onnx
|
89 |
```
|