I download the files and tried to load them locally on HPC. It seems like there are nan numbers in model parameters. This happened for both 40b and 40b-instruct
Did not have this problem for 7b models.
· Sign up or log in to comment