mistral-7b-expert-iteration-iter3 / train_results.json
FrankWu96's picture
Initial commit of model files
38676e4
raw
history blame contribute delete
229 Bytes
{
"epoch": 1.0,
"total_flos": 22194243502080.0,
"train_loss": 0.7179017213155638,
"train_runtime": 1325.9838,
"train_samples": 19996,
"train_samples_per_second": 5.109,
"train_steps_per_second": 0.08
}