Edit model card
YAML Metadata Warning: empty or missing yaml metadata in repo card (https://huggingface.co/docs/hub/model-cards#model-card-metadata)

Model Card for t5_small Summarization Model

fine-tuned version of t5_small for summarization

Model Details

trained on CNN/Daily mail dataset

Training Data

CNN/Daily mail dataset

Training Procedure

  • Learning Rate: 2e-5
  • Epochs: 1
  • Batch Size: 4

How to Use

from transformers import AutoTokenizer, AutoModelForSeq2SeqLM

tokenizer = AutoTokenizer.from_pretrained("t5_small")
model = AutoModelForSeq2SeqLM.from_pretrained("t5_small")

input_text = "The movie was fantastic with a gripping storyline!"
inputs = tokenizer.encode(input_text, return_tensors="pt")
outputs = model(inputs)
print(outputs.logits)

Evaluation

eval_rouge1: 32.13 eval_rouge2: 11.85 eval_rougeL: 23.13 eval_bleu1: 29.29 eval_bleu2: 10.02 eval_bleu4: 3.83

Limitations

bad performance

Ethical Considerations

model has bias by cnn dataset

Downloads last month
0
Safetensors
Model size
60.5M params
Tensor type
F32
·
Inference API
Unable to determine this model's library. Check the docs .