vedantjumle
/

bert-2

@@ -15,10 +15,10 @@ probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [bert-large-uncased](https://huggingface.co/bert-large-uncased) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Train Loss: 0.0645
-- Validation Loss: 0.5256
-- Train Accuracy: 0.89
-- Epoch: 29
 ## Model description
@@ -44,36 +44,7 @@ The following hyperparameters were used during training:
 | Train Loss | Validation Loss | Train Accuracy | Epoch |
 |:----------:|:---------------:|:--------------:|:-----:|
-| 5.0926     | 4.9378          | 0.0267         | 0     |
-| 4.8091     | 4.5518          | 0.19           | 1     |
-| 4.2565     | 3.9529          | 0.4833         | 2     |
-| 3.5496     | 3.3870          | 0.6733         | 3     |
-| 2.9320     | 2.8781          | 0.76           | 4     |
-| 2.3851     | 2.4109          | 0.8233         | 5     |
-| 1.8879     | 2.0233          | 0.8633         | 6     |
-| 1.4473     | 1.6912          | 0.8667         | 7     |
-| 1.1197     | 1.4349          | 0.8867         | 8     |
-| 0.8654     | 1.2351          | 0.8767         | 9     |
-| 0.6786     | 1.1051          | 0.8933         | 10    |
-| 0.5352     | 0.9739          | 0.8967         | 11    |
-| 0.4209     | 0.8814          | 0.8967         | 12    |
-| 0.3455     | 0.8079          | 0.8933         | 13    |
-| 0.2878     | 0.7635          | 0.8967         | 14    |
-| 0.2364     | 0.7142          | 0.9            | 15    |
-| 0.2115     | 0.6899          | 0.9            | 16    |
-| 0.1827     | 0.6554          | 0.9067         | 17    |
-| 0.1625     | 0.6388          | 0.8967         | 18    |
-| 0.1422     | 0.6158          | 0.9033         | 19    |
-| 0.1321     | 0.6181          | 0.9            | 20    |
-| 0.1187     | 0.5910          | 0.8933         | 21    |
-| 0.1072     | 0.5873          | 0.8967         | 22    |
-| 0.0988     | 0.5725          | 0.8967         | 23    |
-| 0.0902     | 0.5619          | 0.8967         | 24    |
-| 0.0844     | 0.5506          | 0.8967         | 25    |
-| 0.0776     | 0.5489          | 0.89           | 26    |
-| 0.0724     | 0.5411          | 0.8933         | 27    |
-| 0.0687     | 0.5280          | 0.8933         | 28    |
-| 0.0645     | 0.5256          | 0.89           | 29    |
 ### Framework versions

 This model is a fine-tuned version of [bert-large-uncased](https://huggingface.co/bert-large-uncased) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Train Loss: 5.1229
+- Validation Loss: 5.0343
+- Train Accuracy: 0.0067
+- Epoch: 0
 ## Model description
 | Train Loss | Validation Loss | Train Accuracy | Epoch |
 |:----------:|:---------------:|:--------------:|:-----:|
+| 5.1229     | 5.0343          | 0.0067         | 0     |
 ### Framework versions

config.json CHANGED Viewed

@@ -7,7 +7,7 @@
   "classifier_dropout": 0.5,
   "gradient_checkpointing": false,
   "hidden_act": "gelu",
-  "hidden_dropout_prob": 0.0,
   "hidden_size": 1024,
   "id2label": {
     "0": "LABEL_0",

   "classifier_dropout": 0.5,
   "gradient_checkpointing": false,
   "hidden_act": "gelu",
+  "hidden_dropout_prob": 0.4,
   "hidden_size": 1024,
   "id2label": {
     "0": "LABEL_0",

tf_model.h5 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:4355bca940c806ec4db38fce331c7a04def5a7f20c48b8e88b34d1f4b1e93246
 size 1341734528

 version https://git-lfs.github.com/spec/v1
+oid sha256:ceb1cb66d5c12908f681272d87fa414f0da25747c3d3938af4c0d693866738fe
 size 1341734528