Shakhovak
/

llama-7b-absa-restaurants

Shakhovak commited on Apr 20

Commit

3307175

•

1 Parent(s): d775053

End of training

Files changed (3) hide show

README.md CHANGED Viewed

@@ -15,7 +15,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [baffo32/decapoda-research-llama-7B-hf](https://huggingface.co/baffo32/decapoda-research-llama-7B-hf) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.0444
 ## Model description
@@ -43,25 +43,28 @@ The following hyperparameters were used during training:
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 2
-- training_steps: 500
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| 0.1289        | 0.36  | 40   | 0.0339          |
-| 0.0345        | 0.72  | 80   | 0.0300          |
-| 0.0304        | 1.08  | 120  | 0.0256          |
-| 0.0198        | 1.44  | 160  | 0.0261          |
-| 0.022         | 1.8   | 200  | 0.0249          |
-| 0.0157        | 2.16  | 240  | 0.0286          |
-| 0.0115        | 2.52  | 280  | 0.0279          |
-| 0.011         | 2.88  | 320  | 0.0295          |
-| 0.0066        | 3.24  | 360  | 0.0372          |
-| 0.005         | 3.6   | 400  | 0.0362          |
-| 0.0036        | 3.96  | 440  | 0.0423          |
-| 0.0018        | 4.32  | 480  | 0.0444          |
 ### Framework versions

 This model is a fine-tuned version of [baffo32/decapoda-research-llama-7B-hf](https://huggingface.co/baffo32/decapoda-research-llama-7B-hf) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.0455
 ## Model description
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 2
+- training_steps: 600
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| 0.1265        | 0.36  | 40   | 0.0377          |
+| 0.0356        | 0.72  | 80   | 0.0319          |
+| 0.0292        | 1.08  | 120  | 0.0276          |
+| 0.0199        | 1.44  | 160  | 0.0264          |
+| 0.0211        | 1.8   | 200  | 0.0277          |
+| 0.0162        | 2.16  | 240  | 0.0306          |
+| 0.012         | 2.52  | 280  | 0.0280          |
+| 0.0121        | 2.88  | 320  | 0.0301          |
+| 0.0075        | 3.24  | 360  | 0.0370          |
+| 0.0052        | 3.6   | 400  | 0.0379          |
+| 0.0058        | 3.96  | 440  | 0.0336          |
+| 0.0026        | 4.32  | 480  | 0.0484          |
+| 0.0016        | 4.68  | 520  | 0.0455          |
+| 0.0021        | 5.05  | 560  | 0.0439          |
+| 0.0008        | 5.41  | 600  | 0.0455          |
 ### Framework versions

adapter_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:d4e98911a51b9526afb2d2e200948ffb77f431836cb977929dc7c99750fbc30a
 size 268528394

 version https://git-lfs.github.com/spec/v1
+oid sha256:f392f82d7c267f87c4cb77406f00fb45361a76971532bc7b737a3e114c42edf7
 size 268528394

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:8944be6d69028282994545af3394230058c8ed222c5a2f61d0186e92dd5263c2
 size 4984

 version https://git-lfs.github.com/spec/v1
+oid sha256:34117e73398806caa00c429d63f42be605e6b2f7eb10f4e6751733e23a819385
 size 4984