adamo1139
/

Yi-34B-200K-AEZAKMI-v2

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

adamo1139 commited on Dec 15, 2023

Commit

a7c90fa

•

1 Parent(s): 4a2a29e

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -88,5 +88,5 @@ There is also some issue with handling long system messages for RP, I was planni
 I will probably be working on de-contaminating base Yi-34B model now. \
 My second run of AEZAKMI v2 fine-tune was just 0.15 epochs and I really like how natural this model is and how rich is it's vocabulary. I will try to train less to hit the sweetspot. \
-I will be uploading LoRA adapter for that second run that was just 0.15 epochs.
 I believe that I might have gotten what I want if I would have stopped training sooner. I don't have checkpoints older than 1500 steps back so I would need to re-run training to get it back.

 I will probably be working on de-contaminating base Yi-34B model now. \
 My second run of AEZAKMI v2 fine-tune was just 0.15 epochs and I really like how natural this model is and how rich is it's vocabulary. I will try to train less to hit the sweetspot. \
+I will be uploading LoRA adapter for that second run that was just 0.15 epochs. \
 I believe that I might have gotten what I want if I would have stopped training sooner. I don't have checkpoints older than 1500 steps back so I would need to re-run training to get it back.