Update README.md
Browse files
README.md
CHANGED
@@ -44,6 +44,14 @@ To cite this model, please use
|
|
44 |
|
45 |
<!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. -->
|
46 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
47 |
### Direct Use
|
48 |
|
49 |
<!-- This section is for the model use without fine-tuning or plugging into a larger ecosystem/app. -->
|
|
|
44 |
|
45 |
<!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. -->
|
46 |
|
47 |
+
### Deployment resource consumption
|
48 |
+
| Precision | Minimum GPU memory (Inference) | Minimum GPU memory (Full Parameter Fine-tuning) |
|
49 |
+
|-------|-------|-------|
|
50 |
+
| float32 | 6.08G | 32.65G |
|
51 |
+
| float16(unquantized) | 3.45G | -(36.94G*) |
|
52 |
+
| bfloat16(unquantized) | 3.45G | 20.47G(33.93G*) |
|
53 |
+
* \* Indicates use of mixed precision
|
54 |
+
|
55 |
### Direct Use
|
56 |
|
57 |
<!-- This section is for the model use without fine-tuning or plugging into a larger ecosystem/app. -->
|