wrong context length??

#6
by AaronFeng753 - opened

in https://huggingface.co/VAGOsolutions/SauerkrautLM-Nemo-12b-Instruct/blob/main/config.json:

"max_position_embeddings": 1024000,

that's 1024K, the original nemo is 128K

are you sure this 1024k is right?

VAGO solutions org

Just checked the mistral Nemo version. It is the same context length there.

Best regards

DavidGF changed discussion status to closed

Sign up or log in to comment