Error Loading The Model

#3
by jonathanjordan21 - opened

Code :

from mamba_ssm.models.mixer_seq_simple import MambaLMHeadModel
model = MambaLMHeadModel.from_pretrained("Zyphra/Mamba-370M", device="cuda")

Error Statement :

RuntimeError: Error(s) in loading state_dict for MambaLMHeadModel:
    Unexpected key(s) in state_dict: "backbone.norm_f.bias". 

This sounds like a configuration issue. Please, take a look at this: https://huggingface.co/Zyphra/Mamba-370M/discussions/2

Sign up or log in to comment