Gabriel
/

bart-base-cnn-xsum-swe

text2text-generation

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

Gabriel commited on Sep 28, 2022

Commit

2783f9e

•

1 Parent(s): 4a77a48

update model card README.md

Files changed (1) hide show

README.md +13 -12

README.md CHANGED Viewed

@@ -16,12 +16,12 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [Gabriel/bart-base-cnn-swe](https://huggingface.co/Gabriel/bart-base-cnn-swe) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 2.1140
-- Rouge1: 30.7101
-- Rouge2: 11.9532
-- Rougel: 25.1864
-- Rougelsum: 25.2227
-- Gen Len: 19.7448
 ## Model description
@@ -40,7 +40,7 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 3.75e-05
 - train_batch_size: 16
 - eval_batch_size: 16
 - seed: 42
@@ -49,21 +49,22 @@ The following hyperparameters were used during training:
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 500
-- num_epochs: 3
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step  | Validation Loss | Rouge1  | Rouge2  | Rougel  | Rougelsum | Gen Len |
 |:-------------:|:-----:|:-----:|:---------------:|:-------:|:-------:|:-------:|:---------:|:-------:|
-| 2.3087        | 1.0   | 6375  | 2.1997          | 29.7666 | 11.0222 | 24.2659 | 24.2915   | 19.7172 |
-| 2.0793        | 2.0   | 12750 | 2.1285          | 30.4447 | 11.7671 | 24.9238 | 24.9622   | 19.7051 |
-| 1.9186        | 3.0   | 19125 | 2.1140          | 30.7101 | 11.9532 | 25.1864 | 25.2227   | 19.7448 |
 ### Framework versions
-- Transformers 4.22.1
 - Pytorch 1.12.1+cu113
 - Datasets 2.5.1
 - Tokenizers 0.12.1

 This model is a fine-tuned version of [Gabriel/bart-base-cnn-swe](https://huggingface.co/Gabriel/bart-base-cnn-swe) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 2.1027
+- Rouge1: 30.9467
+- Rouge2: 12.2589
+- Rougel: 25.4487
+- Rougelsum: 25.4792
+- Gen Len: 19.7379
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 4e-05
 - train_batch_size: 16
 - eval_batch_size: 16
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 500
+- num_epochs: 4
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step  | Validation Loss | Rouge1  | Rouge2  | Rougel  | Rougelsum | Gen Len |
 |:-------------:|:-----:|:-----:|:---------------:|:-------:|:-------:|:-------:|:---------:|:-------:|
+| 2.3076        | 1.0   | 6375  | 2.1986          | 29.7041 | 10.9883 | 24.2149 | 24.2406   | 19.7193 |
+| 2.0733        | 2.0   | 12750 | 2.1246          | 30.4521 | 11.8107 | 24.9519 | 24.9745   | 19.6592 |
+| 1.8933        | 3.0   | 19125 | 2.0989          | 30.9407 | 12.2682 | 25.4135 | 25.4378   | 19.7195 |
+| 1.777         | 4.0   | 25500 | 2.1027          | 30.9467 | 12.2589 | 25.4487 | 25.4792   | 19.7379 |
 ### Framework versions
+- Transformers 4.22.2
 - Pytorch 1.12.1+cu113
 - Datasets 2.5.1
 - Tokenizers 0.12.1