Difference between bf16 and int8

#4
by TheBigBlockPC - opened

How high is the difference between the bf16 abd the int8 quantization of the model. Dies it impact the understanding of the prompt or add any artifacts that wouldn't be in bf16

TheBigBlockPC changed discussion title from Difference between bf16 abd int8 to Difference between bf16 and int8

@TheBigBlockPC Int8 should be slightly lower quality and will produce slightly different results, but nothing too major.

Could you please provide a video generated with the bf16 abd int8 quantization (if possible) and name some differences in quality and prompt understanding. how extreme is speed difference between bf16 and int8

Sign up or log in to comment