Meziane commited on
Commit
f39b00f
1 Parent(s): 7e864d6

End of training

Browse files
README.md CHANGED
@@ -17,10 +17,11 @@ should probably proofread and complete it, then remove this comment. -->
17
 
18
  This model is a fine-tuned version of [google-t5/t5-small](https://huggingface.co/google-t5/t5-small) on an unknown dataset.
19
  It achieves the following results on the evaluation set:
20
- - Loss: 1.9979
21
- - Rouge1: 0.0400
22
- - Rouge2: 0.0255
23
- - Rougel: 0.0372
 
24
  - Exact Match: 0.0
25
 
26
  ## Model description
@@ -46,10 +47,20 @@ The following hyperparameters were used during training:
46
  - seed: 42
47
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
48
  - lr_scheduler_type: linear
49
- - num_epochs: 1
50
 
51
  ### Training results
52
 
 
 
 
 
 
 
 
 
 
 
53
 
54
 
55
  ### Framework versions
 
17
 
18
  This model is a fine-tuned version of [google-t5/t5-small](https://huggingface.co/google-t5/t5-small) on an unknown dataset.
19
  It achieves the following results on the evaluation set:
20
+ - Loss: 2.1456
21
+ - Rouge1: 0.0
22
+ - Rouge2: 0.0
23
+ - Rougel: 0.0
24
+ - Blue: 0.2691
25
  - Exact Match: 0.0
26
 
27
  ## Model description
 
47
  - seed: 42
48
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
49
  - lr_scheduler_type: linear
50
+ - num_epochs: 4
51
 
52
  ### Training results
53
 
54
+ | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Blue | Exact Match |
55
+ |:-------------:|:------:|:----:|:---------------:|:------:|:------:|:------:|:------:|:-----------:|
56
+ | 2.6243 | 0.4545 | 20 | 2.2957 | 0.0 | 0.0 | 0.0 | 0.2691 | 0.0 |
57
+ | 2.5412 | 0.9091 | 40 | 2.2453 | 0.0 | 0.0 | 0.0 | 0.2691 | 0.0 |
58
+ | 2.4349 | 1.3636 | 60 | 2.2129 | 0.0 | 0.0 | 0.0 | 0.2691 | 0.0 |
59
+ | 2.4455 | 1.8182 | 80 | 2.1870 | 0.0 | 0.0 | 0.0 | 0.2691 | 0.0 |
60
+ | 2.4301 | 2.2727 | 100 | 2.1709 | 0.0 | 0.0 | 0.0 | 0.2691 | 0.0 |
61
+ | 2.3886 | 2.7273 | 120 | 2.1594 | 0.0 | 0.0 | 0.0 | 0.2691 | 0.0 |
62
+ | 2.34 | 3.1818 | 140 | 2.1510 | 0.0 | 0.0 | 0.0 | 0.2691 | 0.0 |
63
+ | 2.3932 | 3.6364 | 160 | 2.1456 | 0.0 | 0.0 | 0.0 | 0.2691 | 0.0 |
64
 
65
 
66
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:7fc75d458a617f397d9a15ff6ecaeb57e222f0b5729c4a14f9b7d9f90f4ea75c
3
  size 242041896
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:942b8380a95059e5520b150d6fdde1e14fdf315f7ba92c1588c646f045b0a633
3
  size 242041896
runs/Jul02_22-19-59_1f3e30d5b854/events.out.tfevents.1719958803.1f3e30d5b854.957.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:1b5d6b47520e82839855b39313fd789d05fd5b4ac728636c16c8f6b7817674c7
3
+ size 11950
runs/Jul02_22-19-59_1f3e30d5b854/events.out.tfevents.1719959311.1f3e30d5b854.957.1 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:5cf011a819d85288dee4b2c9d7601b7710968e8310204929b36affb85227a1f1
3
+ size 11978
tokenizer_config.json CHANGED
@@ -1,4 +1,5 @@
1
  {
 
2
  "added_tokens_decoder": {
3
  "0": {
4
  "content": "<pad>",
@@ -930,8 +931,10 @@
930
  "clean_up_tokenization_spaces": true,
931
  "eos_token": "</s>",
932
  "extra_ids": 100,
 
933
  "model_max_length": 512,
934
  "pad_token": "<pad>",
 
935
  "tokenizer_class": "T5Tokenizer",
936
  "unk_token": "<unk>"
937
  }
 
1
  {
2
+ "add_prefix_space": true,
3
  "added_tokens_decoder": {
4
  "0": {
5
  "content": "<pad>",
 
931
  "clean_up_tokenization_spaces": true,
932
  "eos_token": "</s>",
933
  "extra_ids": 100,
934
+ "legacy": true,
935
  "model_max_length": 512,
936
  "pad_token": "<pad>",
937
+ "sp_model_kwargs": {},
938
  "tokenizer_class": "T5Tokenizer",
939
  "unk_token": "<unk>"
940
  }
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:ff910f0a33f85e4d9c87ccdb646b15346b61d4b0a47e9b4f962998e31a8efc97
3
  size 5240
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:94571187c08a8cabc500062c46df1454ba7c1e0bd0ec65ecf854a8cca0b87a79
3
  size 5240