hidden_states = hidden_states.transpose(1, 2).reshape(batch_size, -1, attn.heads * head_dim) 1259 hidden_states = hidden_states.to(query.dtype)
RuntimeError: cutlassF: no kernel found to launch!
· Sign up or log in to comment