Model Card for t5_small Summarization Model

Model Details

This model is Sentiment Analysis model based on T5 model designed by Google Research.

Used the CNN/DailyMail dataset for train, validation. Train : 2871 Validation : 134

batch_size = 4, lr = 2e-5, epochs = 1, weight_decay = 0.01

simply use the tranformers library to load the model and tokenizer.

from transformers import AutoTokenizer, AutoModelForSeq2SeqLM

tokenizer = AutoTokenizer.from_pretrained(model_name) model = AutoModelForSeq2SeqLM.from_pretrained(model_name)

input_text = "This is a test sentence" input = tokenizer.encode(input_text, return_tensors="pt") outputs = model(input) print(outputs.logits)

accuracy : 0.84 eval_loss : 0.21156078577041626 BLEU-1 : 38.46

Due to small dataset and small epoch, the model may not be able to generalize well to other datasets.

The model is trained on the CNN/DailyMail dataset which is a public dataset.