harpomaxx
/

opt350m-codealpaca20k

Model card Files Files and versions Community

harpomaxx commited on Sep 24, 2023

Commit

d618791

•

1 Parent(s): 7562039

Update README.md

Files changed (1) hide show

README.md +22 -1

README.md CHANGED Viewed

@@ -44,7 +44,6 @@ Script used for training is avaiable [here](https://github.com/harpomaxx/llm-fin
 ### Training Arguments:
-- **Output Directory**: `./results`
 - **Batch Size**: 4 (per device)
 - **Gradient Accumulation Steps**: 2
 - **Number of Epochs**: 10
@@ -57,6 +56,28 @@ Script used for training is avaiable [here](https://github.com/harpomaxx/llm-fin
 - **Save Steps**: 250
 - **FP16 Precision**: Enabled
 ## Usage
 ```python

 ### Training Arguments:
 - **Batch Size**: 4 (per device)
 - **Gradient Accumulation Steps**: 2
 - **Number of Epochs**: 10
 - **Save Steps**: 250
 - **FP16 Precision**: Enabled
+### Training information from wandb
+- **train/total_flos:** 72,761,086,854,758,400
+- **train/train_loss:** 1.5557164267259171
+- **train/train_runtime:** 5892.7285 seconds
+- **train/train_steps_per_second:** 4.248
+- **_runtime:** 5891.33976650238 seconds
+- **_timestamp:** 1,695,390,058.0198596
+- **train/epoch:** 10
+- **train/global_step:** 25,030
+- **train/learning_rate:** 8.371592860045851e-12
+- **train/train_samples_per_second:** 33.977
+- **_step:** 2,503
+- **_wandb.runtime:** 5890 seconds
+- **train/loss:** 1.4114
 ## Usage
 ```python