alexpaunoiu
/

bert_key_extractor_finetune

@@ -15,7 +15,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [facebook/bart-base](https://huggingface.co/facebook/bart-base) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.3466
 ## Model description
@@ -43,22 +43,24 @@ The following hyperparameters were used during training:
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 100
-- num_epochs: 10
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| 3.1603        | 0.99  | 14   | 1.9269          |
-| 0.6531        | 1.98  | 28   | 0.5997          |
-| 0.5296        | 2.97  | 42   | 0.5190          |
-| 0.4884        | 3.96  | 56   | 0.4728          |
-| 0.4394        | 4.96  | 70   | 0.4398          |
-| 0.4299        | 5.95  | 84   | 0.4044          |
-| 0.3906        | 6.94  | 98   | 0.3802          |
-| 0.381         | 8.0   | 113  | 0.3635          |
-| 0.3642        | 8.99  | 127  | 0.3545          |
-| 0.3569        | 9.91  | 140  | 0.3466          |
 ### Framework versions

 This model is a fine-tuned version of [facebook/bart-base](https://huggingface.co/facebook/bart-base) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 2.0069
 ## Model description
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 100
+- num_epochs: 12
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| 11.322        | 0.99  | 14   | 10.9937         |
+| 9.3954        | 1.98  | 28   | 8.8487          |
+| 6.5458        | 2.97  | 42   | 5.9401          |
+| 5.1036        | 3.96  | 56   | 4.6196          |
+| 3.8587        | 4.96  | 70   | 3.4009          |
+| 2.7987        | 5.95  | 84   | 2.7571          |
+| 2.6306        | 6.94  | 98   | 2.5074          |
+| 2.3636        | 8.0   | 113  | 2.3132          |
+| 2.2169        | 8.99  | 127  | 2.2248          |
+| 2.1732        | 9.98  | 141  | 2.1092          |
+| 2.0377        | 10.97 | 155  | 2.0351          |
+| 1.9973        | 11.89 | 168  | 2.0069          |
 ### Framework versions

config.json CHANGED Viewed

@@ -71,5 +71,5 @@
   "torch_dtype": "float32",
   "transformers_version": "4.34.1",
   "use_cache": true,
-  "vocab_size": 75113
 }

   "torch_dtype": "float32",
   "transformers_version": "4.34.1",
   "use_cache": true,
+  "vocab_size": 75112
 }

pytorch_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:e6cfaa00860a5ec37f71a6e9fb1e6df1632d0fa92da746d3306deba0c990dd97
-size 634403677

 version https://git-lfs.github.com/spec/v1
+oid sha256:cf98856a6adbce02467c2d4c158ae1a485773fc3136f6e3fce4184022a6c173c
+size 634400605

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:43f2632d7e32626513429e48e06bd40965c07607f8a9e1881f9ed15fe60d9b21
 size 4283

 version https://git-lfs.github.com/spec/v1
+oid sha256:6da1e295eb1209757d7a9fdcb78bb2abcfb5b870204d750d48b3fc5be1b70210
 size 4283