Hyperparams for 7M vs Gen stages?

#1
by alpayariyak - opened

hi, you mention that you train in 2 stages, but only mention 1 set of hyperparameters - did you use the same for both?

Sign up or log in to comment