Mohamed Boghdady commited on
Commit
6a47ae0
1 Parent(s): e07dbd5

Training in progress, step 3000

Browse files
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:acfde6812ec52d58edd0f322dd43d7a40cc7c51df248e3a62b6b25ca9cb07b4a
3
  size 305452744
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:2879cb08959ad221b66c21ab4d4a493e4c88388cffc24e734cebf848dc86db1d
3
  size 305452744
runs/Jul19_15-13-23_8df4908137f9/events.out.tfevents.1721402005.8df4908137f9.35.1 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:278933a2a2abb99e3d8ebe06b0cbb84204f2b50e9e3f9bc049be54c8846071e5
3
- size 9751
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:fc17085243a662bf817d7b60895d82b3e643bc4691b248a3edea859e4ca73dc2
3
+ size 10332
wandb/debug-internal.log CHANGED
The diff for this file is too large to render. See raw diff
 
wandb/run-20240719_150308-wq61zns9/files/output.log CHANGED
@@ -7,3 +7,5 @@ Non-default generation parameters: {'max_length': 512, 'num_beams': 4, 'bad_word
7
  Some non-default generation parameters are set in the model config. These should go into a GenerationConfig file (https://huggingface.co/docs/transformers/generation_strategies#save-a-custom-decoding-strategy-with-your-model) instead. This warning will be raised to an exception in v4.41.
8
  Non-default generation parameters: {'max_length': 512, 'num_beams': 4, 'bad_words_ids': [[62801]], 'forced_eos_token_id': 0}
9
  Some non-default generation parameters are set in the model config. These should go into a GenerationConfig file (https://huggingface.co/docs/transformers/generation_strategies#save-a-custom-decoding-strategy-with-your-model) instead. This warning will be raised to an exception in v4.41.
 
 
 
7
  Some non-default generation parameters are set in the model config. These should go into a GenerationConfig file (https://huggingface.co/docs/transformers/generation_strategies#save-a-custom-decoding-strategy-with-your-model) instead. This warning will be raised to an exception in v4.41.
8
  Non-default generation parameters: {'max_length': 512, 'num_beams': 4, 'bad_words_ids': [[62801]], 'forced_eos_token_id': 0}
9
  Some non-default generation parameters are set in the model config. These should go into a GenerationConfig file (https://huggingface.co/docs/transformers/generation_strategies#save-a-custom-decoding-strategy-with-your-model) instead. This warning will be raised to an exception in v4.41.
10
+ Non-default generation parameters: {'max_length': 512, 'num_beams': 4, 'bad_words_ids': [[62801]], 'forced_eos_token_id': 0}
11
+ Some non-default generation parameters are set in the model config. These should go into a GenerationConfig file (https://huggingface.co/docs/transformers/generation_strategies#save-a-custom-decoding-strategy-with-your-model) instead. This warning will be raised to an exception in v4.41.
wandb/run-20240719_150308-wq61zns9/files/wandb-summary.json CHANGED
@@ -1 +1 @@
1
- {"eval/loss": 0.11479848623275757, "eval/bleu": 21.4563, "eval/gen_len": 67.4739, "eval/runtime": 282.2418, "eval/samples_per_second": 4.418, "eval/steps_per_second": 0.276, "train/epoch": 8.012820512820513, "train/global_step": 2500, "_timestamp": 1721406065.5486827, "_runtime": 4677.009863615036, "_step": 12, "train/loss": 0.0859, "train/grad_norm": 0.31690147519111633, "train/learning_rate": 3.974358974358974e-06}
 
1
+ {"eval/loss": 0.11444152891635895, "eval/bleu": 21.8952, "eval/gen_len": 67.3031, "eval/runtime": 280.9076, "eval/samples_per_second": 4.439, "eval/steps_per_second": 0.278, "train/epoch": 9.615384615384615, "train/global_step": 3000, "_timestamp": 1721406708.9238005, "_runtime": 5320.384981393814, "_step": 14, "train/loss": 0.0814, "train/grad_norm": 0.2923108637332916, "train/learning_rate": 7.692307692307694e-07}
wandb/run-20240719_150308-wq61zns9/logs/debug-internal.log CHANGED
The diff for this file is too large to render. See raw diff
 
wandb/run-20240719_150308-wq61zns9/run-wq61zns9.wandb CHANGED
Binary files a/wandb/run-20240719_150308-wq61zns9/run-wq61zns9.wandb and b/wandb/run-20240719_150308-wq61zns9/run-wq61zns9.wandb differ