Intel
/

distilbert-base-uncased-MRPC-int8-static-inc

Text Classification

text-classfication

neural-compressor

PostTrainingsStatic

Intel® Neural Compressor

Inference Endpoints

Model card Files Files and versions Community

distilbert-base-uncased-MRPC-int8-static-inc / README.md

violetch24's picture

Update README.md

eac9e58 over 1 year ago

|

1.54 kB

metadata

language: en
license: mit
datasets:
  - glue
  - mrpc
metrics:
  - f1
tags:
  - text-classfication
  - nlp
  - neural-compressor
  - PostTrainingsStatic
  - int8
  - Intel® Neural Compressor

Dynamically quantized DistilBERT base uncased finetuned MPRC

Table of Contents

Model Details
How to Get Started With the Model

Model Details

Model Description: This model is a DistilBERT fine-tuned on MPRC statically quantized with optimum-intel through the usage of huggingface/optimum-intel through the usage of Intel® Neural Compressor.

Model Type: Text Classification
Language(s): English
License: Apache-2.0
Parent Model: For more details on the original model, we encourage users to check out this model card.

How to Get Started With the Model

PyTorch

To load the quantized model, you can do as follows:

from optimum.intel.neural_compressor.quantization import IncQuantizedModelForSequenceClassification

model = IncQuantizedModelForSequenceClassification.from_pretrained("Intel/distilbert-base-uncased-MRPC-int8-static")

Test result

	INT8	FP32
Accuracy (eval-f1)	0.9007	0.9027
Model size (MB)	242	268