End of training

Browse files

Files changed (4) hide show

README.md +15 -22
model.safetensors +1 -1
runs/Feb25_02-11-23_default/events.out.tfevents.1708827100.default.93.2 +3 -0
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -20,11 +20,11 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [distilbert-base-multilingual-cased](https://huggingface.co/distilbert-base-multilingual-cased) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 1.6657
-- Accuracy: 0.7257
-- Precision: 0.6860
-- Recall: 0.6797
-- F1: 0.6819
 ## Model description
@@ -44,32 +44,25 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 5e-05
-- train_batch_size: 32
-- eval_batch_size: 32
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- lr_scheduler_warmup_steps: 2432
 - num_epochs: 999
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Accuracy | Precision | Recall | F1     |
 |:-------------:|:-----:|:----:|:---------------:|:--------:|:---------:|:------:|:------:|
-| 0.6095        | 1.0   | 381  | 0.7471          | 0.6649   | 0.6231    | 0.5890 | 0.5836 |
-| 0.8797        | 2.0   | 762  | 0.6495          | 0.7156   | 0.6775    | 0.6794 | 0.6769 |
-| 1.5041        | 3.0   | 1143 | 0.6235          | 0.7182   | 0.6808    | 0.6875 | 0.6820 |
-| 0.5179        | 4.0   | 1524 | 0.6518          | 0.7320   | 0.6925    | 0.6786 | 0.6831 |
-| 0.3651        | 5.0   | 1905 | 0.7282          | 0.7336   | 0.6960    | 0.6720 | 0.6775 |
-| 0.2562        | 6.0   | 2286 | 0.8677          | 0.7159   | 0.6941    | 0.6323 | 0.6299 |
-| 0.0076        | 7.0   | 2667 | 0.9973          | 0.7093   | 0.6627    | 0.6564 | 0.6587 |
-| 0.0017        | 8.0   | 3048 | 1.0098          | 0.7277   | 0.6878    | 0.6763 | 0.6803 |
-| 0.0018        | 9.0   | 3429 | 1.0632          | 0.7257   | 0.6886    | 0.6857 | 0.6857 |
-| 0.012         | 10.0  | 3810 | 1.3255          | 0.7277   | 0.6877    | 0.6811 | 0.6836 |
-| 0.0396        | 11.0  | 4191 | 1.5134          | 0.7103   | 0.6800    | 0.6772 | 0.6743 |
-| 0.0027        | 12.0  | 4572 | 1.6125          | 0.7054   | 0.6675    | 0.6317 | 0.6356 |
-| 0.0005        | 13.0  | 4953 | 1.4959          | 0.7228   | 0.6847    | 0.6646 | 0.6693 |
-| 0.0033        | 14.0  | 5334 | 1.6657          | 0.7257   | 0.6860    | 0.6797 | 0.6819 |
 ### Framework versions

 This model is a fine-tuned version of [distilbert-base-multilingual-cased](https://huggingface.co/distilbert-base-multilingual-cased) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 1.5117
+- Accuracy: 0.7169
+- Precision: 0.6803
+- Recall: 0.6642
+- F1: 0.6693
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 5e-05
+- train_batch_size: 16
+- eval_batch_size: 16
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - num_epochs: 999
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Accuracy | Precision | Recall | F1     |
 |:-------------:|:-----:|:----:|:---------------:|:--------:|:---------:|:------:|:------:|
+| 0.7606        | 1.0   | 761  | 0.6333          | 0.7228   | 0.6856    | 0.6831 | 0.6819 |
+| 0.4448        | 2.0   | 1522 | 0.6450          | 0.7320   | 0.6983    | 0.7065 | 0.7011 |
+| 1.0076        | 3.0   | 2283 | 0.6573          | 0.7346   | 0.7038    | 0.7153 | 0.7069 |
+| 0.1369        | 4.0   | 3044 | 0.8941          | 0.7248   | 0.6855    | 0.6762 | 0.6796 |
+| 0.0096        | 5.0   | 3805 | 1.1590          | 0.7264   | 0.6874    | 0.6911 | 0.6889 |
+| 0.0728        | 6.0   | 4566 | 1.2896          | 0.7366   | 0.7001    | 0.6875 | 0.6910 |
+| 0.0007        | 7.0   | 5327 | 1.5882          | 0.7297   | 0.7027    | 0.6787 | 0.6825 |
+| 0.0106        | 8.0   | 6088 | 1.5117          | 0.7169   | 0.6803    | 0.6642 | 0.6693 |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:870fd801c98103194261a83ed68b56d0569868b9486c387fc06cd7b9f49c930e
 size 541320452

 version https://git-lfs.github.com/spec/v1
+oid sha256:3046e4e9bb36f9b6f646917250c2a059b4542a5e93252528d7833c951a2e6f16
 size 541320452

runs/Feb25_02-11-23_default/events.out.tfevents.1708827100.default.93.2 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:54e05a43254e3d7b0b887048cac6c4214969d4f2c0654239af94ba85cef4fb56
+size 964389

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:f165d3527b2185fc3e4888a23c38278edad2b4364b00e629a26915f700970642
 size 4728

 version https://git-lfs.github.com/spec/v1
+oid sha256:54e15fdeba6702e134d817ee9d689c9e2d08c552723ca7abe4b470c18cf16989
 size 4728