I tried vllm and without vllm, "RuntimeError: probability tensor contains either `inf`, `nan` or element < 0 during inference with transformer" still exist!
1
#2 opened 4 months ago
by
Zaiping
use_exllama?
#1 opened 5 months ago
by
DDDSSS