dimboump commited on
Commit
d7cc385
1 Parent(s): c45899c

update model card README.md

Browse files
Files changed (1) hide show
  1. README.md +82 -0
README.md ADDED
@@ -0,0 +1,82 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ tags:
4
+ - generated_from_trainer
5
+ datasets:
6
+ - glue
7
+ metrics:
8
+ - f1
9
+ - accuracy
10
+ model-index:
11
+ - name: glue_sst_classifier
12
+ results:
13
+ - task:
14
+ name: Text Classification
15
+ type: text-classification
16
+ dataset:
17
+ name: glue
18
+ type: glue
19
+ args: sst2
20
+ metrics:
21
+ - name: F1
22
+ type: f1
23
+ value: 0.901123595505618
24
+ - name: Accuracy
25
+ type: accuracy
26
+ value: 0.8990825688073395
27
+ ---
28
+
29
+ <!-- This model card has been generated automatically according to the information the Trainer had access to. You
30
+ should probably proofread and complete it, then remove this comment. -->
31
+
32
+ # glue_sst_classifier
33
+
34
+ This model is a fine-tuned version of [bert-base-cased](https://huggingface.co/bert-base-cased) on the glue dataset.
35
+ It achieves the following results on the evaluation set:
36
+ - Loss: 0.2371
37
+ - F1: 0.9011
38
+ - Accuracy: 0.8991
39
+
40
+ ## Model description
41
+
42
+ More information needed
43
+
44
+ ## Intended uses & limitations
45
+
46
+ More information needed
47
+
48
+ ## Training and evaluation data
49
+
50
+ More information needed
51
+
52
+ ## Training procedure
53
+
54
+ ### Training hyperparameters
55
+
56
+ The following hyperparameters were used during training:
57
+ - learning_rate: 1e-05
58
+ - train_batch_size: 128
59
+ - eval_batch_size: 128
60
+ - seed: 42
61
+ - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
62
+ - lr_scheduler_type: linear
63
+ - lr_scheduler_warmup_ratio: 0.1
64
+ - num_epochs: 1.0
65
+
66
+ ### Training results
67
+
68
+ | Training Loss | Epoch | Step | Validation Loss | F1 | Accuracy |
69
+ |:-------------:|:-----:|:----:|:---------------:|:------:|:--------:|
70
+ | 0.3653 | 0.19 | 100 | 0.3213 | 0.8717 | 0.8727 |
71
+ | 0.291 | 0.38 | 200 | 0.2662 | 0.8936 | 0.8911 |
72
+ | 0.2239 | 0.57 | 300 | 0.2417 | 0.9081 | 0.9060 |
73
+ | 0.2306 | 0.76 | 400 | 0.2359 | 0.9105 | 0.9094 |
74
+ | 0.2185 | 0.95 | 500 | 0.2371 | 0.9011 | 0.8991 |
75
+
76
+
77
+ ### Framework versions
78
+
79
+ - Transformers 4.18.0
80
+ - Pytorch 1.11.0+cu113
81
+ - Datasets 2.1.0
82
+ - Tokenizers 0.12.1