llama-3-86-lora-pretrain_v2 / train_results.json
ytcheng's picture
End of training
9653e8f verified
{
"epoch": 2.9964796996010326,
"total_flos": 1.1780062420402176e+18,
"train_loss": 2.428264606566656,
"train_runtime": 11538.2228,
"train_samples_per_second": 2.216,
"train_steps_per_second": 0.138
}