gpt-j-6B-tensorrt-int8 / gptj-i8.onnx

Commit History

added onnx model (fake quant) compatible with trt
554833e

igor commited on