bilkultheek commited on
Commit
1370ce5
1 Parent(s): 6e55c47

Training in progress, step 20

Browse files
Files changed (36) hide show
  1. README.md +9 -15
  2. adapter_config.json +0 -4
  3. adapter_model.safetensors +1 -1
  4. runs/Aug20_23-57-27_fastgpuserv/events.out.tfevents.1724180258.fastgpuserv.1412094.0 +3 -0
  5. runs/Aug20_23-57-27_fastgpuserv/events.out.tfevents.1724193656.fastgpuserv.1412094.1 +3 -0
  6. runs/Aug25_12-49-36_fastgpuserv/events.out.tfevents.1724572185.fastgpuserv.1980100.0 +3 -0
  7. runs/Aug25_12-51-29_fastgpuserv/events.out.tfevents.1724572295.fastgpuserv.1980100.1 +3 -0
  8. runs/Aug25_12-52-57_fastgpuserv/events.out.tfevents.1724572380.fastgpuserv.1980100.2 +3 -0
  9. runs/Aug25_12-54-00_fastgpuserv/events.out.tfevents.1724572443.fastgpuserv.1980100.3 +3 -0
  10. runs/Aug25_12-56-16_fastgpuserv/events.out.tfevents.1724572582.fastgpuserv.2002935.0 +3 -0
  11. runs/Aug25_12-58-05_fastgpuserv/events.out.tfevents.1724572692.fastgpuserv.2002935.1 +3 -0
  12. runs/Aug25_13-01-05_fastgpuserv/events.out.tfevents.1724572873.fastgpuserv.2014604.0 +3 -0
  13. runs/Aug25_13-01-05_fastgpuserv/events.out.tfevents.1724572937.fastgpuserv.2014604.1 +3 -0
  14. runs/Aug25_13-02-49_fastgpuserv/events.out.tfevents.1724572977.fastgpuserv.2014604.2 +3 -0
  15. runs/Aug25_13-08-55_fastgpuserv/events.out.tfevents.1724573339.fastgpuserv.2014604.3 +3 -0
  16. runs/Aug25_13-11-19_fastgpuserv/events.out.tfevents.1724573484.fastgpuserv.2039446.0 +3 -0
  17. runs/Aug25_13-13-03_fastgpuserv/events.out.tfevents.1724573587.fastgpuserv.2039446.1 +3 -0
  18. runs/Aug25_13-14-36_fastgpuserv/events.out.tfevents.1724573679.fastgpuserv.2039446.2 +3 -0
  19. runs/Aug25_13-16-49_fastgpuserv/events.out.tfevents.1724573812.fastgpuserv.2039446.3 +3 -0
  20. runs/Aug25_13-18-04_fastgpuserv/events.out.tfevents.1724573887.fastgpuserv.2039446.4 +3 -0
  21. runs/Aug25_13-19-17_fastgpuserv/events.out.tfevents.1724573959.fastgpuserv.2039446.5 +3 -0
  22. runs/Aug25_13-21-16_fastgpuserv/events.out.tfevents.1724574078.fastgpuserv.2039446.6 +3 -0
  23. runs/Aug25_13-22-31_fastgpuserv/events.out.tfevents.1724574155.fastgpuserv.2039446.7 +3 -0
  24. runs/Aug25_13-24-04_fastgpuserv/events.out.tfevents.1724574247.fastgpuserv.2039446.8 +3 -0
  25. runs/Aug25_13-25-56_fastgpuserv/events.out.tfevents.1724574359.fastgpuserv.2039446.9 +3 -0
  26. runs/Aug25_13-28-50_fastgpuserv/events.out.tfevents.1724574532.fastgpuserv.2039446.10 +3 -0
  27. runs/Aug25_13-29-54_fastgpuserv/events.out.tfevents.1724574596.fastgpuserv.2039446.11 +3 -0
  28. runs/Aug25_13-31-12_fastgpuserv/events.out.tfevents.1724574675.fastgpuserv.2039446.12 +3 -0
  29. runs/Aug25_13-33-47_fastgpuserv/events.out.tfevents.1724574833.fastgpuserv.2094483.0 +3 -0
  30. runs/Aug25_13-37-11_fastgpuserv/events.out.tfevents.1724575034.fastgpuserv.2094483.1 +3 -0
  31. runs/Aug25_13-39-10_fastgpuserv/events.out.tfevents.1724575153.fastgpuserv.2094483.2 +3 -0
  32. runs/Aug25_13-40-19_fastgpuserv/events.out.tfevents.1724575222.fastgpuserv.2094483.3 +3 -0
  33. runs/Aug25_13-45-34_fastgpuserv/events.out.tfevents.1724575537.fastgpuserv.2094483.4 +3 -0
  34. runs/Aug25_13-47-07_fastgpuserv/events.out.tfevents.1724575630.fastgpuserv.2094483.5 +3 -0
  35. runs/Aug26_10-52-41_fastgpuserv/events.out.tfevents.1724651569.fastgpuserv.681719.0 +3 -0
  36. training_args.bin +2 -2
README.md CHANGED
@@ -6,18 +6,23 @@ tags:
6
  - sft
7
  - generated_from_trainer
8
  model-index:
9
- - name: Cold-Data-LLama-2-7B
10
  results: []
11
  ---
12
 
13
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
14
  should probably proofread and complete it, then remove this comment. -->
15
 
16
- # Cold-Data-LLama-2-7B
17
 
18
  This model is a fine-tuned version of [NousResearch/Llama-2-7b-hf](https://huggingface.co/NousResearch/Llama-2-7b-hf) on the None dataset.
19
  It achieves the following results on the evaluation set:
20
- - Loss: 0.0526
 
 
 
 
 
21
 
22
  ## Model description
23
 
@@ -36,7 +41,7 @@ More information needed
36
  ### Training hyperparameters
37
 
38
  The following hyperparameters were used during training:
39
- - learning_rate: 0.002
40
  - train_batch_size: 16
41
  - eval_batch_size: 32
42
  - seed: 42
@@ -47,17 +52,6 @@ The following hyperparameters were used during training:
47
  - lr_scheduler_warmup_ratio: 0.03
48
  - num_epochs: 10
49
 
50
- ### Training results
51
-
52
- | Training Loss | Epoch | Step | Validation Loss |
53
- |:-------------:|:-----:|:----:|:---------------:|
54
- | 0.1019 | 1.992 | 249 | 0.1022 |
55
- | 0.0542 | 3.984 | 498 | 0.0540 |
56
- | 0.0508 | 5.976 | 747 | 0.0513 |
57
- | 0.0479 | 7.968 | 996 | 0.0515 |
58
- | 0.0472 | 9.96 | 1245 | 0.0537 |
59
-
60
-
61
  ### Framework versions
62
 
63
  - PEFT 0.12.0
 
6
  - sft
7
  - generated_from_trainer
8
  model-index:
9
+ - name: Cold-Again-LLama-2-7B
10
  results: []
11
  ---
12
 
13
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
14
  should probably proofread and complete it, then remove this comment. -->
15
 
16
+ # Cold-Again-LLama-2-7B
17
 
18
  This model is a fine-tuned version of [NousResearch/Llama-2-7b-hf](https://huggingface.co/NousResearch/Llama-2-7b-hf) on the None dataset.
19
  It achieves the following results on the evaluation set:
20
+ - eval_loss: 1.3661
21
+ - eval_runtime: 90.0594
22
+ - eval_samples_per_second: 1.11
23
+ - eval_steps_per_second: 0.044
24
+ - epoch: 5.76
25
+ - step: 36
26
 
27
  ## Model description
28
 
 
41
  ### Training hyperparameters
42
 
43
  The following hyperparameters were used during training:
44
+ - learning_rate: 0.0001
45
  - train_batch_size: 16
46
  - eval_batch_size: 32
47
  - seed: 42
 
52
  - lr_scheduler_warmup_ratio: 0.03
53
  - num_epochs: 10
54
 
 
 
 
 
 
 
 
 
 
 
 
55
  ### Framework versions
56
 
57
  - PEFT 0.12.0
adapter_config.json CHANGED
@@ -15,10 +15,6 @@
15
  "megatron_config": null,
16
  "megatron_core": "megatron.core",
17
  "modules_to_save": [
18
- "classifier",
19
- "score",
20
- "classifier",
21
- "score",
22
  "classifier",
23
  "score"
24
  ],
 
15
  "megatron_config": null,
16
  "megatron_core": "megatron.core",
17
  "modules_to_save": [
 
 
 
 
18
  "classifier",
19
  "score"
20
  ],
adapter_model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:a30675c3031820143a09a8ecc574d5f76914025f376b8d6aa708f125f8d18014
3
  size 134267920
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:fd50fb82d204807b97190a86a0bdbeeda83e66d8cdb296e8122ccbecb3f803e8
3
  size 134267920
runs/Aug20_23-57-27_fastgpuserv/events.out.tfevents.1724180258.fastgpuserv.1412094.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:cfea3dd99266f8e2d927b3263ad7f9dc2333b546e001b052feec3f12f408c84a
3
+ size 8700
runs/Aug20_23-57-27_fastgpuserv/events.out.tfevents.1724193656.fastgpuserv.1412094.1 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:0dd21beb2705c281008c444c908b3bbc99f8b40a7608ab8cc9afd0c2307affc3
3
+ size 7406
runs/Aug25_12-49-36_fastgpuserv/events.out.tfevents.1724572185.fastgpuserv.1980100.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:94bac5057816ac7b9bd02bccaff1c89290da5dabbdcfe0062596415e54e5a5db
3
+ size 4438
runs/Aug25_12-51-29_fastgpuserv/events.out.tfevents.1724572295.fastgpuserv.1980100.1 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e94d3717138def6bc618ef8e5c929b48ecefd83aa5b2c8ff342374948ae39f21
3
+ size 5598
runs/Aug25_12-52-57_fastgpuserv/events.out.tfevents.1724572380.fastgpuserv.1980100.2 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:6b97f604468f1ef7f28857b5d74e6bdf9d077c084d5a6e9fb19ae42999dd2b93
3
+ size 5598
runs/Aug25_12-54-00_fastgpuserv/events.out.tfevents.1724572443.fastgpuserv.1980100.3 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:c0a91abd9d44e497de4b388cfcd3eeb161700f5a90dbbf797992e4060d85fd0d
3
+ size 4184
runs/Aug25_12-56-16_fastgpuserv/events.out.tfevents.1724572582.fastgpuserv.2002935.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:c262987c6874977559b16bc9f9cf632c92511fbfe2ab81569ac9fa6aeba34209
3
+ size 5598
runs/Aug25_12-58-05_fastgpuserv/events.out.tfevents.1724572692.fastgpuserv.2002935.1 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:430307b7d4b3dfc7f07960a800f4f6c8c3b49704d19a14f0096c6793f57039fa
3
+ size 5598
runs/Aug25_13-01-05_fastgpuserv/events.out.tfevents.1724572873.fastgpuserv.2014604.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:41393a92903e62df14bf495a4039eccc18ea5204ffaedd62b705fddd9e69a0cf
3
+ size 5598
runs/Aug25_13-01-05_fastgpuserv/events.out.tfevents.1724572937.fastgpuserv.2014604.1 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:1b46e86d24bca382788d36b178fdc98484f5f5f25f0b80682f3a67b5774787d4
3
+ size 5598
runs/Aug25_13-02-49_fastgpuserv/events.out.tfevents.1724572977.fastgpuserv.2014604.2 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a2296f480f72760cadb8ccec85b048b59941e2b707bd6c0a8d65e52b12441a75
3
+ size 5596
runs/Aug25_13-08-55_fastgpuserv/events.out.tfevents.1724573339.fastgpuserv.2014604.3 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:1cacbe5d9ef832b1f20887456d37fadc641d787b68741c1c309061df6ad9054f
3
+ size 4184
runs/Aug25_13-11-19_fastgpuserv/events.out.tfevents.1724573484.fastgpuserv.2039446.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:339e20ee57f60c47413524e27c5cd52de457b6f8de8fb8edc3ad397528d5f3a2
3
+ size 6076
runs/Aug25_13-13-03_fastgpuserv/events.out.tfevents.1724573587.fastgpuserv.2039446.1 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:64b089d6bda5d19a795ebf1e10cdf2b6c9a42af74d8a209fa080f251c30e2e11
3
+ size 6076
runs/Aug25_13-14-36_fastgpuserv/events.out.tfevents.1724573679.fastgpuserv.2039446.2 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e205c713fd6766b63147f4c86f273f6e443c1cae50def3f1d6978665d62a3821
3
+ size 6076
runs/Aug25_13-16-49_fastgpuserv/events.out.tfevents.1724573812.fastgpuserv.2039446.3 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:defe742e00d150c4f7cd16f19c449d8551e0850f8f033e36814c13839328d9e1
3
+ size 6076
runs/Aug25_13-18-04_fastgpuserv/events.out.tfevents.1724573887.fastgpuserv.2039446.4 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:6da021f154c2c8db83797b311ec472783efae80f41409ac8e5c04836c4c352b4
3
+ size 6076
runs/Aug25_13-19-17_fastgpuserv/events.out.tfevents.1724573959.fastgpuserv.2039446.5 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:787ebf30bff7b63bddb73ee484ab8a584ced63199a259e3efea9960c71c7813c
3
+ size 6076
runs/Aug25_13-21-16_fastgpuserv/events.out.tfevents.1724574078.fastgpuserv.2039446.6 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:71f035a87b1d3f3f0e91533cdc4b67555f0f3e2e1e8d969b192fa50d7ff3cdf4
3
+ size 6076
runs/Aug25_13-22-31_fastgpuserv/events.out.tfevents.1724574155.fastgpuserv.2039446.7 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:19f27422f32a5b79f46db60c6cd1053643e2908e01c11939a858d9bfd2c29f37
3
+ size 6076
runs/Aug25_13-24-04_fastgpuserv/events.out.tfevents.1724574247.fastgpuserv.2039446.8 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:0c02bfebe7df0f3c9c72a1cca27188e32cfbebbfe151483aec81ee725a09024d
3
+ size 6076
runs/Aug25_13-25-56_fastgpuserv/events.out.tfevents.1724574359.fastgpuserv.2039446.9 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:07dcb0dd548a5dbf42d7374ddee73f563c67cef4bfebcebb968c4de62ba37666
3
+ size 6077
runs/Aug25_13-28-50_fastgpuserv/events.out.tfevents.1724574532.fastgpuserv.2039446.10 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:37bf42ac45b1be45a9462485e3ac1f766c670721d4149072b2ee580063dbf0ca
3
+ size 6077
runs/Aug25_13-29-54_fastgpuserv/events.out.tfevents.1724574596.fastgpuserv.2039446.11 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:baaa5b103f99bf013c76c4c2314bad677518e5133ec10a662a84a9a258e535eb
3
+ size 6077
runs/Aug25_13-31-12_fastgpuserv/events.out.tfevents.1724574675.fastgpuserv.2039446.12 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4084bb131681fdb9652839f5a026b30e1810b0c54e3a6f24f02fed43c7244107
3
+ size 4184
runs/Aug25_13-33-47_fastgpuserv/events.out.tfevents.1724574833.fastgpuserv.2094483.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:7b37216c58bec635d1dfb11687cd46222b42efd07b4c3ff2204a950a3fc05c96
3
+ size 6077
runs/Aug25_13-37-11_fastgpuserv/events.out.tfevents.1724575034.fastgpuserv.2094483.1 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:23cecba266d80cb784f471afcfa3c8022a76a8e0f206d63fec76efcfe2f4777c
3
+ size 6077
runs/Aug25_13-39-10_fastgpuserv/events.out.tfevents.1724575153.fastgpuserv.2094483.2 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b9810a57917dc5deacee5f3a38e7dc7a26e0c5efae0b5ad82e00e6954d7fe990
3
+ size 6077
runs/Aug25_13-40-19_fastgpuserv/events.out.tfevents.1724575222.fastgpuserv.2094483.3 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:65a4f4b45c2ff2131a135c230eaf1a0e67f873fa5b1c06ee1eb73e38f03323d1
3
+ size 6077
runs/Aug25_13-45-34_fastgpuserv/events.out.tfevents.1724575537.fastgpuserv.2094483.4 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:cad21eb3a2a322b320b97ec5258fd775edd1542772f0d62d6e4c4cc52d10fef5
3
+ size 6077
runs/Aug25_13-47-07_fastgpuserv/events.out.tfevents.1724575630.fastgpuserv.2094483.5 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d6d613233e81b43d3dfd5bfcd2e1487f2e7c7ee0fc35873bf13644d53a1aafba
3
+ size 4436
runs/Aug26_10-52-41_fastgpuserv/events.out.tfevents.1724651569.fastgpuserv.681719.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:0d6bb75db714a8b97962615252d95be365b7b9a6b67e4e011f09a0bdecf02670
3
+ size 6501
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:fcca89a273cd46cfdaeef7e5c3e4ebc823dc716ea92997748a620eae8c3fbd38
3
- size 5368
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:aa87077c98cf82a25c6e6ca0ea31291d12cf2d89cf8409479fa0f6dfcde184fd
3
+ size 5432