huggingface-course
/

mt5-small-finetuned-amazon-en-es

@@ -2,7 +2,6 @@
 license: apache-2.0
 tags:
 - generated_from_trainer
-- pipeline:summarization
 datasets:
 - null
 metrics:
@@ -16,7 +15,7 @@ model-index:
     metrics:
     - name: Rouge1
       type: rouge
-      value: 8.8272
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -26,12 +25,12 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [google/mt5-small](https://huggingface.co/google/mt5-small) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 3.3342
-- Rouge1: 8.8272
-- Rouge2: 2.5114
-- Rougel: 8.6749
-- Rougelsum: 8.6722
-- Gen Len: 4.2877
 ## Model description
@@ -56,15 +55,18 @@ The following hyperparameters were used during training:
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 3
 ### Training results
-| Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
-|:-------------:|:-----:|:----:|:---------------:|:------:|:------:|:------:|:---------:|:-------:|
-| 9.4562        | 1.0   | 2202 | 3.5591          | 6.6009 | 1.7239 | 6.5036 | 6.5228    | 3.4434  |
-| 4.6481        | 2.0   | 4404 | 3.3600          | 7.3535 | 1.9174 | 7.2846 | 7.3053    | 3.809   |
-| 4.3333        | 3.0   | 6606 | 3.3342          | 8.8272 | 2.5114 | 8.6749 | 8.6722    | 4.2877  |
 ### Framework versions

 license: apache-2.0
 tags:
 - generated_from_trainer
 datasets:
 - null
 metrics:
     metrics:
     - name: Rouge1
       type: rouge
+      value: 10.8752
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 This model is a fine-tuned version of [google/mt5-small](https://huggingface.co/google/mt5-small) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 3.1491
+- Rouge1: 10.8752
+- Rouge2: 3.8695
+- Rougel: 10.6991
+- Rougelsum: 10.6616
+- Gen Len: 5.6085
 ## Model description
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 6
 ### Training results
+| Training Loss | Epoch | Step  | Validation Loss | Rouge1  | Rouge2 | Rougel  | Rougelsum | Gen Len |
+|:-------------:|:-----:|:-----:|:---------------:|:-------:|:------:|:-------:|:---------:|:-------:|
+| 9.1733        | 1.0   | 2202  | 3.4863          | 6.3629  | 1.4637 | 6.2501  | 6.2752    | 3.3302  |
+| 4.4547        | 2.0   | 4404  | 3.2809          | 9.1283  | 2.992  | 8.9851  | 9.0487    | 4.7642  |
+| 4.0581        | 3.0   | 6606  | 3.2108          | 10.5207 | 3.7411 | 10.2595 | 10.234    | 5.3208  |
+| 3.8821        | 4.0   | 8808  | 3.1701          | 10.8636 | 4.0944 | 10.6462 | 10.6468   | 5.2453  |
+| 3.7857        | 5.0   | 11010 | 3.1600          | 10.9456 | 4.5187 | 10.784  | 10.7542   | 5.691   |
+| 3.7273        | 6.0   | 13212 | 3.1491          | 10.8752 | 3.8695 | 10.6991 | 10.6616   | 5.6085  |
 ### Framework versions

pytorch_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:4a89d30bbdfb33b505516cfb5f81c0eca7d5521acbf7ecc4275d8cd4dc0d1229
 size 1200770885

 version https://git-lfs.github.com/spec/v1
+oid sha256:9fc1cebaa89d1bddeb41b5c3a2014bfc2d70cde4af9bb38ac8c5e578f61bcf22
 size 1200770885

tokenizer.json CHANGED Viewed

The diff for this file is too large to render. See raw diff

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:528e1467127904051d0c15f6a2903cb85414605e00a9e072e543cc8884e47188
 size 2735

 version https://git-lfs.github.com/spec/v1
+oid sha256:b6c696d829fb994c3c16fe17700311c5b844cb2570a26fe27827c9598487fe10
 size 2735