lukasdrg commited on
Commit
831c2e8
1 Parent(s): f7bbab3

End of training

Browse files
README.md CHANGED
@@ -15,7 +15,7 @@ should probably proofread and complete it, then remove this comment. -->
15
 
16
  This model is a fine-tuned version of [allenai/longformer-base-4096](https://huggingface.co/allenai/longformer-base-4096) on the None dataset.
17
  It achieves the following results on the evaluation set:
18
- - Loss: 19.3114
19
 
20
  ## Model description
21
 
@@ -49,17 +49,17 @@ The following hyperparameters were used during training:
49
 
50
  | Training Loss | Epoch | Step | Validation Loss |
51
  |:-------------:|:-----:|:----:|:---------------:|
52
- | 20.339 | 0.36 | 4 | 20.7522 |
53
- | 18.817 | 0.71 | 8 | 20.7752 |
54
- | 19.8054 | 1.07 | 12 | 20.9602 |
55
- | 20.1845 | 1.42 | 16 | 20.5820 |
56
- | 19.0897 | 1.78 | 20 | 20.4297 |
57
- | 19.4773 | 2.13 | 24 | 20.5561 |
58
- | 19.448 | 2.49 | 28 | 20.3986 |
59
- | 18.6107 | 2.84 | 32 | 20.1962 |
60
- | 19.3799 | 3.2 | 36 | 20.0190 |
61
- | 19.0794 | 3.56 | 40 | 19.8159 |
62
- | 18.489 | 3.91 | 44 | 19.3114 |
63
 
64
 
65
  ### Framework versions
 
15
 
16
  This model is a fine-tuned version of [allenai/longformer-base-4096](https://huggingface.co/allenai/longformer-base-4096) on the None dataset.
17
  It achieves the following results on the evaluation set:
18
+ - Loss: 3.7059
19
 
20
  ## Model description
21
 
 
49
 
50
  | Training Loss | Epoch | Step | Validation Loss |
51
  |:-------------:|:-----:|:----:|:---------------:|
52
+ | 4.4747 | 0.36 | 4 | 4.0650 |
53
+ | 3.6551 | 0.71 | 8 | 4.6086 |
54
+ | 3.6023 | 1.07 | 12 | 3.2223 |
55
+ | 4.1356 | 1.42 | 16 | 4.3734 |
56
+ | 3.5103 | 1.78 | 20 | 3.4911 |
57
+ | 3.8186 | 2.13 | 24 | 4.5828 |
58
+ | 3.5699 | 2.49 | 28 | 3.9042 |
59
+ | 3.9307 | 2.84 | 32 | 3.8804 |
60
+ | 3.4662 | 3.2 | 36 | 3.8901 |
61
+ | 4.0907 | 3.56 | 40 | 4.0173 |
62
+ | 3.7467 | 3.91 | 44 | 3.7059 |
63
 
64
 
65
  ### Framework versions
pytorch_model.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:f460d7b59084037bdf348c26c8aca3ff3e61ea7a0fb8f142ab5c90d5d79994ff
3
  size 594941266
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:265c28cd1e5c1e744079d8c3e34fdcdb6d10326dcb723b6817e2a8af2ee68755
3
  size 594941266
special_tokens_map.json CHANGED
@@ -9,7 +9,7 @@
9
  "rstrip": false,
10
  "single_word": false
11
  },
12
- "pad_token": "</s>",
13
  "sep_token": "</s>",
14
  "unk_token": "<unk>"
15
  }
 
9
  "rstrip": false,
10
  "single_word": false
11
  },
12
+ "pad_token": "<pad>",
13
  "sep_token": "</s>",
14
  "unk_token": "<unk>"
15
  }
tokenizer_config.json CHANGED
@@ -49,7 +49,7 @@
49
  "errors": "replace",
50
  "mask_token": "<mask>",
51
  "model_max_length": 4096,
52
- "pad_token": "</s>",
53
  "sep_token": "</s>",
54
  "tokenizer_class": "LongformerTokenizer",
55
  "trim_offsets": true,
 
49
  "errors": "replace",
50
  "mask_token": "<mask>",
51
  "model_max_length": 4096,
52
+ "pad_token": "<pad>",
53
  "sep_token": "</s>",
54
  "tokenizer_class": "LongformerTokenizer",
55
  "trim_offsets": true,
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:6aac62db9fee002e6152d6fdc2ed8982411ae77a86726793ccde63b0fac1cd27
3
  size 4536
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a767253255c6344443e8ccbc9aeae5206ac26c39ffe44b82ff14c024665eb89e
3
  size 4536