hung200504
/

bert-30

@@ -15,7 +15,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [deepset/bert-base-cased-squad2](https://huggingface.co/deepset/bert-base-cased-squad2) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 10.2691
 ## Model description
@@ -40,67 +40,45 @@ The following hyperparameters were used during training:
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 5
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| 11.3132       | 0.09  | 5    | 12.3055         |
-| 11.4109       | 0.18  | 10   | 12.2292         |
-| 10.9744       | 0.27  | 15   | 12.1547         |
-| 11.0771       | 0.36  | 20   | 12.0814         |
-| 11.0342       | 0.45  | 25   | 12.0101         |
-| 11.0327       | 0.55  | 30   | 11.9396         |
-| 10.2954       | 0.64  | 35   | 11.8706         |
-| 10.8979       | 0.73  | 40   | 11.8043         |
-| 10.432        | 0.82  | 45   | 11.7386         |
-| 10.3023       | 0.91  | 50   | 11.6747         |
-| 10.0494       | 1.0   | 55   | 11.6128         |
-| 10.2273       | 1.09  | 60   | 11.5521         |
-| 10.3139       | 1.18  | 65   | 11.4931         |
-| 10.5075       | 1.27  | 70   | 11.4349         |
-| 10.0234       | 1.36  | 75   | 11.3790         |
-| 10.4276       | 1.45  | 80   | 11.3238         |
-| 10.1397       | 1.55  | 85   | 11.2699         |
-| 10.0675       | 1.64  | 90   | 11.2174         |
-| 9.8835        | 1.73  | 95   | 11.1665         |
-| 10.0738       | 1.82  | 100  | 11.1169         |
-| 9.6112        | 1.91  | 105  | 11.0687         |
-| 9.9186        | 2.0   | 110  | 11.0227         |
-| 9.8411        | 2.09  | 115  | 10.9779         |
-| 9.6506        | 2.18  | 120  | 10.9342         |
-| 9.7831        | 2.27  | 125  | 10.8916         |
-| 9.8835        | 2.36  | 130  | 10.8509         |
-| 9.4752        | 2.45  | 135  | 10.8111         |
-| 9.8176        | 2.55  | 140  | 10.7731         |
-| 9.3628        | 2.64  | 145  | 10.7369         |
-| 9.819         | 2.73  | 150  | 10.7017         |
-| 9.572         | 2.82  | 155  | 10.6681         |
-| 9.522         | 2.91  | 160  | 10.6356         |
-| 9.6874        | 3.0   | 165  | 10.6046         |
-| 9.6037        | 3.09  | 170  | 10.5750         |
-| 9.5624        | 3.18  | 175  | 10.5468         |
-| 9.2702        | 3.27  | 180  | 10.5202         |
-| 9.1347        | 3.36  | 185  | 10.4947         |
-| 9.8154        | 3.45  | 190  | 10.4706         |
-| 9.4045        | 3.55  | 195  | 10.4475         |
-| 9.2453        | 3.64  | 200  | 10.4262         |
-| 9.1087        | 3.73  | 205  | 10.4062         |
-| 8.985         | 3.82  | 210  | 10.3875         |
-| 9.0054        | 3.91  | 215  | 10.3705         |
-| 9.4764        | 4.0   | 220  | 10.3545         |
-| 9.13          | 4.09  | 225  | 10.3401         |
-| 9.4397        | 4.18  | 230  | 10.3272         |
-| 9.0841        | 4.27  | 235  | 10.3153         |
-| 9.5885        | 4.36  | 240  | 10.3048         |
-| 9.4137        | 4.45  | 245  | 10.2958         |
-| 9.1068        | 4.55  | 250  | 10.2878         |
-| 9.1388        | 4.64  | 255  | 10.2816         |
-| 8.8014        | 4.73  | 260  | 10.2763         |
-| 8.9782        | 4.82  | 265  | 10.2727         |
-| 9.222         | 4.91  | 270  | 10.2701         |
-| 9.292         | 5.0   | 275  | 10.2691         |
 ### Framework versions

 This model is a fine-tuned version of [deepset/bert-base-cased-squad2](https://huggingface.co/deepset/bert-base-cased-squad2) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 11.0923
 ## Model description
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 3
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| 11.3132       | 0.09  | 5    | 12.3056         |
+| 11.4114       | 0.18  | 10   | 12.2307         |
+| 10.9764       | 0.27  | 15   | 12.1584         |
+| 11.0818       | 0.36  | 20   | 12.0885         |
+| 11.0423       | 0.45  | 25   | 12.0202         |
+| 11.0449       | 0.55  | 30   | 11.9548         |
+| 10.3108       | 0.64  | 35   | 11.8919         |
+| 10.9207       | 0.73  | 40   | 11.8320         |
+| 10.4606       | 0.82  | 45   | 11.7743         |
+| 10.3376       | 0.91  | 50   | 11.7188         |
+| 10.091        | 1.0   | 55   | 11.6658         |
+| 10.2796       | 1.09  | 60   | 11.6154         |
+| 10.3783       | 1.18  | 65   | 11.5677         |
+| 10.5827       | 1.27  | 70   | 11.5216         |
+| 10.1084       | 1.36  | 75   | 11.4785         |
+| 10.5279       | 1.45  | 80   | 11.4372         |
+| 10.2497       | 1.55  | 85   | 11.3984         |
+| 10.1908       | 1.64  | 90   | 11.3618         |
+| 10.0181       | 1.73  | 95   | 11.3275         |
+| 10.2313       | 1.82  | 100  | 11.2956         |
+| 9.7781        | 1.91  | 105  | 11.2663         |
+| 10.1123       | 2.0   | 110  | 11.2391         |
+| 10.0491       | 2.09  | 115  | 11.2141         |
+| 9.869         | 2.18  | 120  | 11.1916         |
+| 10.0292       | 2.27  | 125  | 11.1714         |
+| 10.1515       | 2.36  | 130  | 11.1534         |
+| 9.7539        | 2.45  | 135  | 11.1377         |
+| 10.1323       | 2.55  | 140  | 11.1242         |
+| 9.6956        | 2.64  | 145  | 11.1136         |
+| 10.1937       | 2.73  | 150  | 11.1050         |
+| 9.9615        | 2.82  | 155  | 11.0983         |
+| 9.9249        | 2.91  | 160  | 11.0940         |
+| 10.1271       | 3.0   | 165  | 11.0923         |
 ### Framework versions

pytorch_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:f90d3c49d1b385da5225959f1983305f7884366c67740854b1f35b84235b33e4
 size 430953062

 version https://git-lfs.github.com/spec/v1
+oid sha256:76f8db24159f21692b0feb6bbb56eb12b732e394cd378337c71c143119dbcaee
 size 430953062

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:990544e7dd405cd14477e1f4cadc28632bf8de56bc17e198986794ccb67011ef
 size 4472

 version https://git-lfs.github.com/spec/v1
+oid sha256:be1f1739ca19b12ae4954ca2293c0ba9612dbedb3757bbb9e1b064d30756b5bf
 size 4472