---
pipeline_tag: token-classification
tags:
- code
license: apache-2.0
datasets:
- Alex123321/english_cefr_dataset
language:
- en
metrics:
- accuracy
library_name: transformers
---
# Model Card: BERT-based CEFR Classifier

## Overview

This repository contains a model trained to predict Common European Framework of Reference (CEFR) levels for a given text using a BERT-based model architecture. The model was fine-tuned on the CEFR dataset, and the `bert-base-...` pre-trained model was used as the base.

## Model Details

- Model architecture: BERT (base model: `bert-base-...`)
- Task: CEFR level prediction for text classification
- Training dataset: CEFR dataset
- Fine-tuning: Epochs, Loss, Accuracy, etc.

## Performance

The model's performance during training is summarized below:


| Epoch | Training Loss | Validation Loss 
|-------|---------------|-----------------
| 1     | 0.350600      | 0.400248        
| 2     | 0.300800      | 0.449286        
| 3     | 0.218800      | 0.510898        
| 4     | 0.150300      | 0.599973        
| 5     | 0.099000      | 0.678500 

Additional metrics:

- Training Loss: 0.22364403076171874
- Training Runtime: 5274.6105 seconds
- Training Samples per Second: 1.303
- Total Floating Point Operations: 1.4498992298262528e+16

## Usage

1. Install the required libraries by running `pip install transformers`.
2. Load the trained model and use it for CEFR level prediction.


from transformers import pipeline

# Load the model
model_name = "AbdulSami/bert-base-cased-cefr"

classifier = pipeline("text-classification", model=model_name)

# Text for prediction
text = "This is a sample text for CEFR classification."

# Predict CEFR level
predictions = classifier(text)

# Print the predictions
print(predictions)