metadata
language: en
license: mit
datasets:
- glue
- mrpc
metrics:
- f1
tags:
- text-classfication
- nlp
- neural-compressor
- PostTrainingsStatic
- int8
- Intel® Neural Compressor
Dynamically quantized DistilBERT base uncased finetuned MPRC
Table of Contents
Model Details
Model Description: This model is a DistilBERT fine-tuned on MPRC statically quantized with optimum-intel through the usage of huggingface/optimum-intel through the usage of Intel® Neural Compressor.
- Model Type: Text Classification
- Language(s): English
- License: Apache-2.0
- Parent Model: For more details on the original model, we encourage users to check out this model card.
How to Get Started With the Model
PyTorch
To load the quantized model, you can do as follows:
from optimum.intel.neural_compressor.quantization import IncQuantizedModelForSequenceClassification
model = IncQuantizedModelForSequenceClassification.from_pretrained("Intel/distilbert-base-uncased-MRPC-int8-static")
Test result
INT8 | FP32 | |
---|---|---|
Accuracy (eval-f1) | 0.9007 | 0.9027 |
Model size (MB) | 242 | 268 |