violetch24 commited on
Commit
b94d8a3
1 Parent(s): 631ad76

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +38 -0
README.md CHANGED
@@ -1,3 +1,41 @@
1
  ---
 
2
  license: mit
 
 
 
 
 
 
 
 
 
 
 
3
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
+ language: en
3
  license: mit
4
+ tags:
5
+ - text-classfication
6
+ - int8
7
+ - Intel® Neural Compressor
8
+ - PostTrainingStatic
9
+ - bert
10
+ datasets:
11
+ - mrpc
12
+ - stsb
13
+ metrics:
14
+ - f1
15
  ---
16
+
17
+ # INT8 BERT base uncased finetuned STS-B
18
+
19
+ ## Post-training static quantization
20
+
21
+ ### PyTorch
22
+
23
+ This is an INT8 PyTorch model quantized with [huggingface/optimum-intel](https://github.com/huggingface/optimum-intel) through the usage of [Intel® Neural Compressor](https://github.com/intel/neural-compressor).
24
+
25
+ The original fp32 model comes from the fine-tuned model [textattack/bert-base-uncased-STS-B](https://huggingface.co/textattack/bert-base-uncased-STS-B).
26
+
27
+ #### Test result
28
+
29
+ | |INT8|FP32|
30
+ |---|:---:|:---:|
31
+ | **Accuracy (eval-f1)** |0.8755|0.8805|
32
+ | **Model size (MB)** |118|438|
33
+
34
+ #### Load with optimum:
35
+
36
+ ```python
37
+ from optimum.intel.neural_compressor.quantization import IncQuantizedModelForSequenceClassification
38
+ int8_model = IncQuantizedModelForSequenceClassification.from_pretrained(
39
+ 'Intel/bert-base-uncased-STS-B-int8',
40
+ )
41
+ ```