Shakhovak
/

llama-7b-absa-restaurants

Generated from Trainer

Model card Files Files and versions Community

Shakhovak commited on Apr 21

Commit

1422322

•

1 Parent(s): 3307175

End of training

Browse files

Files changed (3) hide show

README.md +12 -17
adapter_model.bin +1 -1
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -15,7 +15,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [baffo32/decapoda-research-llama-7B-hf](https://huggingface.co/baffo32/decapoda-research-llama-7B-hf) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.0455
 ## Model description
@@ -43,28 +43,23 @@ The following hyperparameters were used during training:
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 2
-- training_steps: 600
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| 0.1265        | 0.36  | 40   | 0.0377          |
-| 0.0356        | 0.72  | 80   | 0.0319          |
-| 0.0292        | 1.08  | 120  | 0.0276          |
-| 0.0199        | 1.44  | 160  | 0.0264          |
-| 0.0211        | 1.8   | 200  | 0.0277          |
-| 0.0162        | 2.16  | 240  | 0.0306          |
-| 0.012         | 2.52  | 280  | 0.0280          |
-| 0.0121        | 2.88  | 320  | 0.0301          |
-| 0.0075        | 3.24  | 360  | 0.0370          |
-| 0.0052        | 3.6   | 400  | 0.0379          |
-| 0.0058        | 3.96  | 440  | 0.0336          |
-| 0.0026        | 4.32  | 480  | 0.0484          |
-| 0.0016        | 4.68  | 520  | 0.0455          |
-| 0.0021        | 5.05  | 560  | 0.0439          |
-| 0.0008        | 5.41  | 600  | 0.0455          |
 ### Framework versions

 This model is a fine-tuned version of [baffo32/decapoda-research-llama-7B-hf](https://huggingface.co/baffo32/decapoda-research-llama-7B-hf) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.0345
 ## Model description
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 2
+- training_steps: 400
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| 0.1281        | 0.36  | 40   | 0.0380          |
+| 0.0358        | 0.72  | 80   | 0.0314          |
+| 0.0296        | 1.08  | 120  | 0.0263          |
+| 0.0211        | 1.44  | 160  | 0.0254          |
+| 0.0203        | 1.8   | 200  | 0.0236          |
+| 0.0163        | 2.16  | 240  | 0.0273          |
+| 0.0115        | 2.52  | 280  | 0.0276          |
+| 0.0105        | 2.88  | 320  | 0.0265          |
+| 0.0081        | 3.24  | 360  | 0.0306          |
+| 0.0046        | 3.6   | 400  | 0.0345          |
 ### Framework versions

adapter_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:f392f82d7c267f87c4cb77406f00fb45361a76971532bc7b737a3e114c42edf7
 size 268528394

 version https://git-lfs.github.com/spec/v1
+oid sha256:cdd1a0f65a71b0f3400daa9be46ae6777b39d3b084d7f83c657919f9fba4a6dd
 size 268528394

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:34117e73398806caa00c429d63f42be605e6b2f7eb10f4e6751733e23a819385
 size 4984

 version https://git-lfs.github.com/spec/v1
+oid sha256:f2c8e7a60e2d0e02704e415f7f8e4f0f58fa01832d62e0a691607a78a5a2e58a
 size 4984