avramesh/ft-attempt2

Files changed (5) hide show

README.md CHANGED Viewed

@@ -14,9 +14,9 @@ should probably proofread and complete it, then remove this comment. -->
 # shawgpt-ft
-This model is a fine-tuned version of [meta-llama/Meta-Llama-3-8B-Instruct](https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 2.3938
 ## Model description
@@ -51,16 +51,16 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
-| 4.0211        | 0.9231 | 3    | 3.4617          |
-| 3.8387        | 1.8462 | 6    | 3.2827          |
-| 3.6289        | 2.7692 | 9    | 3.1373          |
-| 2.5801        | 4.0    | 13   | 2.9469          |
-| 3.2557        | 4.9231 | 16   | 2.7959          |
-| 3.0471        | 5.8462 | 19   | 2.6640          |
-| 2.8925        | 6.7692 | 22   | 2.5527          |
-| 2.0712        | 8.0    | 26   | 2.4415          |
-| 2.6765        | 8.9231 | 29   | 2.3988          |
-| 1.8573        | 9.2308 | 30   | 2.3938          |
 ### Framework versions

 # shawgpt-ft
+This model is a fine-tuned version of [meta-llama/Meta-Llama-3-8B-Instruct](https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 1.1847
 ## Model description
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
+| 1.2637        | 0.9992 | 652  | 1.1808          |
+| 1.142         | 2.0    | 1305 | 1.1452          |
+| 1.0811        | 2.9992 | 1957 | 1.1300          |
+| 1.0268        | 4.0    | 2610 | 1.1262          |
+| 0.9815        | 4.9992 | 3262 | 1.1269          |
+| 0.9389        | 6.0    | 3915 | 1.1323          |
+| 0.9061        | 6.9992 | 4567 | 1.1498          |
+| 0.8749        | 8.0    | 5220 | 1.1575          |
+| 0.8523        | 8.9992 | 5872 | 1.1676          |
+| 0.8351        | 9.9923 | 6520 | 1.1847          |
 ### Framework versions

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:b7ae5cf5a82d62af8b2822c2cf362ede59148c2f98ddc68bc4febdd74c5e3418
 size 8397056

 version https://git-lfs.github.com/spec/v1
+oid sha256:642d3d87072ee738e1f805c0778ad74f6846407c98f1677c829debb6590d9191
 size 8397056

runs/Jul05_19-36-49_palomino/events.out.tfevents.1720208210.palomino.768072.0 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:daff43215180ccd0f4b96574cbaa177187193e9d25ac27d5cfded5ba7f38054b
+size 4905

runs/Jul05_19-39-22_palomino/events.out.tfevents.1720208362.palomino.768548.0 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:002e9430e9e0c5af8abc899ad5c5d3f639e521c2cbc75999cf2fe0b46c73665e
+size 10079

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:a55c8d6d0f17977eddfca857e5504b37710103b8264a8d44bd0c3bbb85a12eb3
 size 5112

 version https://git-lfs.github.com/spec/v1
+oid sha256:03014ae9ef0d63e2323c58e6e7c234a4e7f182cd952461c53e553a1fd246ad8f
 size 5112