End of training

Files changed (4) hide show

README.md CHANGED Viewed

@@ -15,7 +15,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [allenai/longformer-base-4096](https://huggingface.co/allenai/longformer-base-4096) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 20.5838
 ## Model description
@@ -49,17 +49,17 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| 21.7989       | 0.36  | 4    | 22.5208         |
-| 20.1605       | 0.71  | 8    | 22.5098         |
-| 21.2348       | 1.07  | 12   | 22.6825         |
-| 21.615        | 1.42  | 16   | 22.2299         |
-| 20.4123       | 1.78  | 20   | 22.0543         |
-| 20.8174       | 2.13  | 24   | 22.1491         |
-| 20.7756       | 2.49  | 28   | 21.9382         |
-| 19.8654       | 2.84  | 32   | 21.6735         |
-| 20.6743       | 3.2   | 36   | 21.3893         |
-| 20.3151       | 3.56  | 40   | 21.1435         |
-| 19.6614       | 3.91  | 44   | 20.5838         |
 ### Framework versions

 This model is a fine-tuned version of [allenai/longformer-base-4096](https://huggingface.co/allenai/longformer-base-4096) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 19.3114
 ## Model description
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| 20.339        | 0.36  | 4    | 20.7522         |
+| 18.817        | 0.71  | 8    | 20.7752         |
+| 19.8054       | 1.07  | 12   | 20.9602         |
+| 20.1845       | 1.42  | 16   | 20.5820         |
+| 19.0897       | 1.78  | 20   | 20.4297         |
+| 19.4773       | 2.13  | 24   | 20.5561         |
+| 19.448        | 2.49  | 28   | 20.3986         |
+| 18.6107       | 2.84  | 32   | 20.1962         |
+| 19.3799       | 3.2   | 36   | 20.0190         |
+| 19.0794       | 3.56  | 40   | 19.8159         |
+| 18.489        | 3.91  | 44   | 19.3114         |
 ### Framework versions

pytorch_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:33d7a6ab97ae94c2ab33ae3d3ffdaf065ba7e61d85b6eb52a45f99be35e9f77b
 size 594941266

 version https://git-lfs.github.com/spec/v1
+oid sha256:f460d7b59084037bdf348c26c8aca3ff3e61ea7a0fb8f142ab5c90d5d79994ff
 size 594941266

special_tokens_map.json CHANGED Viewed

@@ -9,7 +9,7 @@
     "rstrip": false,
     "single_word": false
   },
-  "pad_token": "<pad>",
   "sep_token": "</s>",
   "unk_token": "<unk>"
 }

     "rstrip": false,
     "single_word": false
   },
+  "pad_token": "</s>",
   "sep_token": "</s>",
   "unk_token": "<unk>"
 }

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:49aba76b9e4e90c8c3669d6397a822e705d137da816c65797e3b29b6eeb24a83
 size 4536

 version https://git-lfs.github.com/spec/v1
+oid sha256:6aac62db9fee002e6152d6fdc2ed8982411ae77a86726793ccde63b0fac1cd27
 size 4536