Ransaka commited on
Commit
4b208d2
1 Parent(s): df8999d

End of training

Browse files
README.md CHANGED
@@ -17,8 +17,8 @@ should probably proofread and complete it, then remove this comment. -->
17
 
18
  This model is a fine-tuned version of [Ransaka/sinhala-bert-medium-v2](https://huggingface.co/Ransaka/sinhala-bert-medium-v2) on the None dataset.
19
  It achieves the following results on the evaluation set:
20
- - Loss: 0.1118
21
- - Accuracy: 0.9736
22
 
23
  ## Model description
24
 
@@ -37,22 +37,24 @@ More information needed
37
  ### Training hyperparameters
38
 
39
  The following hyperparameters were used during training:
40
- - learning_rate: 2e-05
41
  - train_batch_size: 16
42
  - eval_batch_size: 16
43
  - seed: 42
44
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
45
  - lr_scheduler_type: linear
46
- - num_epochs: 3
47
 
48
  ### Training results
49
 
50
  | Training Loss | Epoch | Step | Validation Loss | Accuracy |
51
  |:-------------:|:-----:|:----:|:---------------:|:--------:|
52
- | No log | 0.7 | 100 | 0.1270 | 0.9561 |
53
- | No log | 1.4 | 200 | 0.1239 | 0.9649 |
54
- | No log | 2.1 | 300 | 0.1114 | 0.9719 |
55
- | No log | 2.8 | 400 | 0.1118 | 0.9736 |
 
 
56
 
57
 
58
  ### Framework versions
 
17
 
18
  This model is a fine-tuned version of [Ransaka/sinhala-bert-medium-v2](https://huggingface.co/Ransaka/sinhala-bert-medium-v2) on the None dataset.
19
  It achieves the following results on the evaluation set:
20
+ - Loss: 0.1610
21
+ - Accuracy: 0.9692
22
 
23
  ## Model description
24
 
 
37
  ### Training hyperparameters
38
 
39
  The following hyperparameters were used during training:
40
+ - learning_rate: 0.0002
41
  - train_batch_size: 16
42
  - eval_batch_size: 16
43
  - seed: 42
44
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
45
  - lr_scheduler_type: linear
46
+ - num_epochs: 5
47
 
48
  ### Training results
49
 
50
  | Training Loss | Epoch | Step | Validation Loss | Accuracy |
51
  |:-------------:|:-----:|:----:|:---------------:|:--------:|
52
+ | 0.279 | 0.78 | 100 | 0.2297 | 0.9441 |
53
+ | 0.1572 | 1.56 | 200 | 0.1215 | 0.9635 |
54
+ | 0.0823 | 2.34 | 300 | 0.1393 | 0.9635 |
55
+ | 0.0557 | 3.12 | 400 | 0.1206 | 0.9669 |
56
+ | 0.0209 | 3.91 | 500 | 0.1480 | 0.9692 |
57
+ | 0.0059 | 4.69 | 600 | 0.1610 | 0.9692 |
58
 
59
 
60
  ### Framework versions
runs/Dec02_18-06-51_0c1b7c8a2656/events.out.tfevents.1701540412.0c1b7c8a2656.1555.6 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:46e090a672efb088511de33c61af024a63c82198ce714ce7004af5bf77cdf559
3
- size 6750
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:3a6fcd4d0f43f1dbb1180ebd55fc63a1e4b7007a86e57ff12c53898ac1024df5
3
+ size 7584