color_descriptions

Browse files

Files changed (3) hide show

README.md +26 -1
adapter_model.safetensors +1 -1
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -17,6 +17,8 @@ should probably proofread and complete it, then remove this comment. -->
 # results
 This model is a fine-tuned version of [TinyLlama/TinyLlama-1.1B-intermediate-step-1431k-3T](https://huggingface.co/TinyLlama/TinyLlama-1.1B-intermediate-step-1431k-3T) on an unknown dataset.
 ## Model description
@@ -44,10 +46,33 @@ The following hyperparameters were used during training:
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: cosine
 - lr_scheduler_warmup_ratio: 0.03
-- training_steps: 200
 ### Training results
 ### Framework versions

 # results
 This model is a fine-tuned version of [TinyLlama/TinyLlama-1.1B-intermediate-step-1431k-3T](https://huggingface.co/TinyLlama/TinyLlama-1.1B-intermediate-step-1431k-3T) on an unknown dataset.
+It achieves the following results on the evaluation set:
+- Loss: 0.5881
 ## Model description
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: cosine
 - lr_scheduler_warmup_ratio: 0.03
+- num_epochs: 2
 ### Training results
+| Training Loss | Epoch  | Step | Validation Loss |
+|:-------------:|:------:|:----:|:---------------:|
+| 0.6511        | 0.0948 | 50   | 0.7913          |
+| 0.5479        | 0.1896 | 100  | 0.7001          |
+| 0.5125        | 0.2844 | 150  | 0.6768          |
+| 0.4974        | 0.3791 | 200  | 0.6564          |
+| 0.4947        | 0.4739 | 250  | 0.6490          |
+| 0.4802        | 0.5687 | 300  | 0.6383          |
+| 0.4762        | 0.6635 | 350  | 0.6289          |
+| 0.4678        | 0.7583 | 400  | 0.6233          |
+| 0.4742        | 0.8531 | 450  | 0.6157          |
+| 0.4633        | 0.9479 | 500  | 0.6127          |
+| 0.6096        | 1.0427 | 550  | 0.6027          |
+| 0.6137        | 1.1374 | 600  | 0.5986          |
+| 0.6163        | 1.2322 | 650  | 0.5963          |
+| 0.6078        | 1.3270 | 700  | 0.5943          |
+| 0.6019        | 1.4218 | 750  | 0.5921          |
+| 0.615         | 1.5166 | 800  | 0.5906          |
+| 0.6061        | 1.6114 | 850  | 0.5897          |
+| 0.6106        | 1.7062 | 900  | 0.5890          |
+| 0.6027        | 1.8009 | 950  | 0.5886          |
+| 0.6094        | 1.8957 | 1000 | 0.5883          |
+| 0.5261        | 1.9905 | 1050 | 0.5881          |
 ### Framework versions

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:6ca95ca1697f78ca1ce3b6f28c30437f47cd8e831bbf7d3959bfedb03ef10003
 size 57701064

 version https://git-lfs.github.com/spec/v1
+oid sha256:1a2f716d20045f5a32c6b9a608f618466fe254493721f18005d2f1dd889029eb
 size 57701064

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:605dffa90d76fc6c7a4c5da03f1bf2202588f4135c66355a9c1c1597078f319e
 size 5368

 version https://git-lfs.github.com/spec/v1
+oid sha256:d6d9c3d4237312a4bba1a2b8b3197b72e01c812c239c98b08f19e97a2f942d4e
 size 5368