bit-dny commited on
Commit
5c6b1ad
1 Parent(s): faea48b

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +8 -0
README.md CHANGED
@@ -44,6 +44,14 @@ To cite this model, please use
44
 
45
  <!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. -->
46
 
 
 
 
 
 
 
 
 
47
  ### Direct Use
48
 
49
  <!-- This section is for the model use without fine-tuning or plugging into a larger ecosystem/app. -->
 
44
 
45
  <!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. -->
46
 
47
+ ### Deployment resource consumption
48
+ | Precision | Minimum GPU memory (Inference) | Minimum GPU memory (Full Parameter Fine-tuning) |
49
+ |-------|-------|-------|
50
+ | float32 | 6.08G | 32.65G |
51
+ | float16(unquantized) | 3.45G | -(36.94G*) |
52
+ | bfloat16(unquantized) | 3.45G | 20.47G(33.93G*) |
53
+ * \* Indicates use of mixed precision
54
+
55
  ### Direct Use
56
 
57
  <!-- This section is for the model use without fine-tuning or plugging into a larger ecosystem/app. -->