End of training

Browse files

Files changed (6) hide show

README.md +27 -27
adapter_config.json +3 -3
adapter_model.safetensors +2 -2
emissions.csv +1 -1
runs/Jul25_18-18-54_msc-modeltrain-pod/events.out.tfevents.1721931538.msc-modeltrain-pod.1693.0 +3 -0
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -16,7 +16,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [meta-llama/Meta-Llama-3-8B-Instruct](https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.7124
 ## Model description
@@ -35,7 +35,7 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 5e-05
 - train_batch_size: 8
 - eval_batch_size: 8
 - seed: 42
@@ -50,31 +50,31 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| 3.8363        | 1.36  | 10   | 3.5698          |
-| 3.2454        | 2.71  | 20   | 2.7356          |
-| 2.2867        | 4.07  | 30   | 1.7205          |
-| 1.4623        | 5.42  | 40   | 1.2840          |
-| 1.1723        | 6.78  | 50   | 1.0982          |
-| 1.0295        | 8.14  | 60   | 0.9766          |
-| 0.9085        | 9.49  | 70   | 0.8723          |
-| 0.784         | 10.85 | 80   | 0.7651          |
-| 0.717         | 12.2  | 90   | 0.7394          |
-| 0.6745        | 13.56 | 100  | 0.7235          |
-| 0.6402        | 14.92 | 110  | 0.7157          |
-| 0.6251        | 16.27 | 120  | 0.7089          |
-| 0.5961        | 17.63 | 130  | 0.7100          |
-| 0.5871        | 18.98 | 140  | 0.7042          |
-| 0.5714        | 20.34 | 150  | 0.7070          |
-| 0.5582        | 21.69 | 160  | 0.7062          |
-| 0.5457        | 23.05 | 170  | 0.7076          |
-| 0.5392        | 24.41 | 180  | 0.7094          |
-| 0.5354        | 25.76 | 190  | 0.7100          |
-| 0.5278        | 27.12 | 200  | 0.7105          |
-| 0.5275        | 28.47 | 210  | 0.7110          |
-| 0.5249        | 29.83 | 220  | 0.7123          |
-| 0.5204        | 31.19 | 230  | 0.7123          |
-| 0.5198        | 32.54 | 240  | 0.7123          |
-| 0.5195        | 33.9  | 250  | 0.7124          |
 ### Framework versions

 This model is a fine-tuned version of [meta-llama/Meta-Llama-3-8B-Instruct](https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 1.4797
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 0.0001
 - train_batch_size: 8
 - eval_batch_size: 8
 - seed: 42
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| 3.365         | 1.36  | 10   | 2.0638          |
+| 1.3671        | 2.71  | 20   | 0.9814          |
+| 0.817         | 4.07  | 30   | 0.7618          |
+| 0.6648        | 5.42  | 40   | 0.7134          |
+| 0.5897        | 6.78  | 50   | 0.6871          |
+| 0.5076        | 8.14  | 60   | 0.6776          |
+| 0.4545        | 9.49  | 70   | 0.7360          |
+| 0.4059        | 10.85 | 80   | 0.7673          |
+| 0.3544        | 12.2  | 90   | 0.8158          |
+| 0.3161        | 13.56 | 100  | 0.8801          |
+| 0.2844        | 14.92 | 110  | 0.9591          |
+| 0.259         | 16.27 | 120  | 0.9817          |
+| 0.2405        | 17.63 | 130  | 1.0922          |
+| 0.2298        | 18.98 | 140  | 1.1705          |
+| 0.2125        | 20.34 | 150  | 1.1817          |
+| 0.2073        | 21.69 | 160  | 1.2862          |
+| 0.1998        | 23.05 | 170  | 1.3352          |
+| 0.1912        | 24.41 | 180  | 1.3434          |
+| 0.1883        | 25.76 | 190  | 1.4113          |
+| 0.1851        | 27.12 | 200  | 1.4113          |
+| 0.1796        | 28.47 | 210  | 1.4654          |
+| 0.1805        | 29.83 | 220  | 1.4565          |
+| 0.1768        | 31.19 | 230  | 1.4650          |
+| 0.1763        | 32.54 | 240  | 1.4769          |
+| 0.1752        | 33.9  | 250  | 1.4797          |
 ### Framework versions

adapter_config.json CHANGED Viewed

@@ -7,11 +7,11 @@
   "init_lora_weights": true,
   "layers_pattern": null,
   "layers_to_transform": null,
-  "lora_alpha": 32,
-  "lora_dropout": 0.3,
   "modules_to_save": null,
   "peft_type": "LORA",
-  "r": 16,
   "revision": null,
   "target_modules": [
     "q_proj",

   "init_lora_weights": true,
   "layers_pattern": null,
   "layers_to_transform": null,
+  "lora_alpha": 128,
+  "lora_dropout": 0.1,
   "modules_to_save": null,
   "peft_type": "LORA",
+  "r": 64,
   "revision": null,
   "target_modules": [
     "q_proj",

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:853047c7ee6f98c051a455fe53ed043a81d61b9f38d831e7f1d882e9b2d0c0a8
-size 37774528

 version https://git-lfs.github.com/spec/v1
+oid sha256:7baefad0749b95301d1ef5729beb829eccb0fff2ef98c4b1dfe752fecdb4b7cf
+size 151020944

emissions.csv CHANGED Viewed

	@@ -1,2 +1,2 @@
1	timestamp,experiment_id,project_name,duration,emissions,energy_consumed,country_name,country_iso_code,region,on_cloud,cloud_provider,cloud_region
2	- 2024-07-~~25T00~~:29:17,~~b81b783c~~-~~301a~~-~~439a~~-~~a0a5~~-~~4917c65bc6de~~,codecarbon,~~6290~~.~~977914571762~~,0.~~3557443410838277~~,0.~~5292955629092988~~,United Kingdom,GBR,scotland,N,,


1	timestamp,experiment_id,project_name,duration,emissions,energy_consumed,country_name,country_iso_code,region,on_cloud,cloud_provider,cloud_region
2	+ 2024-07-25T20:07:05,71cb2d14-a3e9-44f2-9adf-aa99d60af3f0,codecarbon,6487.100156784058,0.3647758733647877,0.5427331623606918,United Kingdom,GBR,scotland,N,,

runs/Jul25_18-18-54_msc-modeltrain-pod/events.out.tfevents.1721931538.msc-modeltrain-pod.1693.0 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:47731826fc06887da5057a09bf903796b3bbbc0d7f41bc931c955955d7dffb8e
+size 17035

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:10b3b3a3d7323b4bda4c1a482867d25717c65236d1bd44bb96cd5c9ce33dd107
 size 4984

 version https://git-lfs.github.com/spec/v1
+oid sha256:fa99f7e0c9cee069a0ee479ad9d6186ca6da7c27642e43b0a7cf82a3fc09d7e6
 size 4984