OpenAssistant/oasst-sft-1-pythia-12b · Does not generate text

Apr 7, 2023

All the time I use the model, it generates only <|endoftext|>. How can I fix it?

from transformers import AutoModelForCausalLM, AutoModelForSeq2SeqLM, GPTNeoXForCausalLM, AutoTokenizer

checkpoint = "OpenAssistant/oasst-sft-1-pythia-12b"

tokenizer = AutoTokenizer.from_pretrained(checkpoint, cache_dir='models_hf')
model = AutoModelForCausalLM.from_pretrained(checkpoint, device_map="auto", load_in_8bit=True, cache_dir='models_hf')

message = "Hello, I am"
inp = "<|prompter|>"+message+"<|endoftext|><|assistant|>"
data = tokenizer([inp], return_tensors="pt")
data = {k: v.to(model.device) for k, v in data.items() if k in ("input_ids", "attention_mask")}
outputs = model.generate(**data)
print(tokenizer.decode(outputs[0]))

<|prompter|>Hello, I am<|endoftext|><|assistant|><|endoftext|>

hanifabdlh

Apr 12, 2023

same issues, i use example code from here : https://huggingface.co/EleutherAI/pythia-12b-deduped

yeshel

Apr 18, 2023

Having the same issue.

My two cents -
One cent: If I run a forward pass, the logits are all nan
Two cents: Running the cpu version works fine

yeshel

Apr 18, 2023

Three cents: works well without load_in_8bit=True. (using 2 gpus in my case)

davidhung

May 30, 2023

I have the same problem. It doens't work with 8 bit loader