luluw commited on
Commit
f400590
1 Parent(s): e0f3d28

End of training

Browse files
Files changed (2) hide show
  1. README.md +23 -15
  2. model.safetensors +1 -1
README.md CHANGED
@@ -1,6 +1,6 @@
1
  ---
2
  license: apache-2.0
3
- base_model: luluw/t5-base-finetuned-billsum
4
  tags:
5
  - generated_from_trainer
6
  metrics:
@@ -15,14 +15,14 @@ should probably proofread and complete it, then remove this comment. -->
15
 
16
  # t5-base-finetuned-billsum
17
 
18
- This model is a fine-tuned version of [luluw/t5-base-finetuned-billsum](https://huggingface.co/luluw/t5-base-finetuned-billsum) on an unknown dataset.
19
  It achieves the following results on the evaluation set:
20
- - Loss: 1.5204
21
- - Rouge1: 48.9735
22
- - Rouge2: 29.0909
23
- - Rougel: 39.1634
24
- - Rougelsum: 42.7953
25
- - Gen Len: 112.7247
26
 
27
  ## Model description
28
 
@@ -42,22 +42,30 @@ More information needed
42
 
43
  The following hyperparameters were used during training:
44
  - learning_rate: 2e-05
45
- - train_batch_size: 8
46
- - eval_batch_size: 8
47
  - seed: 42
48
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
49
  - lr_scheduler_type: linear
50
- - lr_scheduler_warmup_steps: 1000
51
- - num_epochs: 3
52
  - mixed_precision_training: Native AMP
53
 
54
  ### Training results
55
 
56
  | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
57
  |:-------------:|:------:|:----:|:---------------:|:-------:|:-------:|:-------:|:---------:|:--------:|
58
- | 1.3003 | 0.8442 | 2000 | 1.1182 | 56.5942 | 37.4635 | 45.9359 | 50.3437 | 109.3659 |
59
- | 1.2443 | 1.6885 | 4000 | 1.1433 | 56.3579 | 36.706 | 45.4519 | 49.8982 | 118.3600 |
60
- | 1.5978 | 2.5327 | 6000 | 1.5204 | 48.9735 | 29.0909 | 39.1634 | 42.7953 | 112.7247 |
 
 
 
 
 
 
 
 
61
 
62
 
63
  ### Framework versions
 
1
  ---
2
  license: apache-2.0
3
+ base_model: google-t5/t5-base
4
  tags:
5
  - generated_from_trainer
6
  metrics:
 
15
 
16
  # t5-base-finetuned-billsum
17
 
18
+ This model is a fine-tuned version of [google-t5/t5-base](https://huggingface.co/google-t5/t5-base) on an unknown dataset.
19
  It achieves the following results on the evaluation set:
20
+ - Loss: 1.1725
21
+ - Rouge1: 54.1481
22
+ - Rouge2: 33.3953
23
+ - Rougel: 42.8337
24
+ - Rougelsum: 47.5287
25
+ - Gen Len: 116.8581
26
 
27
  ## Model description
28
 
 
42
 
43
  The following hyperparameters were used during training:
44
  - learning_rate: 2e-05
45
+ - train_batch_size: 16
46
+ - eval_batch_size: 16
47
  - seed: 42
48
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
49
  - lr_scheduler_type: linear
50
+ - lr_scheduler_warmup_steps: 500
51
+ - num_epochs: 5
52
  - mixed_precision_training: Native AMP
53
 
54
  ### Training results
55
 
56
  | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
57
  |:-------------:|:------:|:----:|:---------------:|:-------:|:-------:|:-------:|:---------:|:--------:|
58
+ | 2.5944 | 0.4219 | 500 | 1.2582 | 50.6899 | 31.6418 | 40.2325 | 44.2687 | 111.7541 |
59
+ | 1.3588 | 0.8439 | 1000 | 1.1591 | 55.865 | 35.992 | 44.7636 | 49.2805 | 114.3552 |
60
+ | 1.275 | 1.2658 | 1500 | 1.1214 | 56.3449 | 37.0781 | 45.604 | 49.9711 | 110.7724 |
61
+ | 1.3266 | 1.6878 | 2000 | 1.1791 | 54.4797 | 33.8689 | 43.1813 | 47.8507 | 114.8278 |
62
+ | 1.3591 | 2.1097 | 2500 | 1.1725 | 54.243 | 33.5179 | 42.9187 | 47.6231 | 116.4601 |
63
+ | 1.3484 | 2.5316 | 3000 | 1.1724 | 54.1433 | 33.3914 | 42.8348 | 47.5267 | 116.7736 |
64
+ | 1.3467 | 2.9536 | 3500 | 1.1724 | 54.1359 | 33.3794 | 42.8167 | 47.5153 | 116.7819 |
65
+ | 1.3483 | 3.3755 | 4000 | 1.1724 | 54.1446 | 33.3947 | 42.8274 | 47.5313 | 116.8529 |
66
+ | 1.342 | 3.7975 | 4500 | 1.1724 | 54.1341 | 33.3888 | 42.8239 | 47.5291 | 116.7957 |
67
+ | 1.3475 | 4.2194 | 5000 | 1.1725 | 54.1411 | 33.3931 | 42.8224 | 47.5218 | 116.8229 |
68
+ | 1.3542 | 4.6414 | 5500 | 1.1725 | 54.1481 | 33.3953 | 42.8337 | 47.5287 | 116.8581 |
69
 
70
 
71
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:79d88be4a2f9320b4fed7073b05ca0dd27ee53505c34170066ec66e92894b344
3
  size 891644712
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:82a1bbbe67948726b2cbeb41a61f6fe96cf96edc2d9b61b3806425534e3ac46c
3
  size 891644712