Difference between bf16 and int8
#4
by
TheBigBlockPC
- opened
How high is the difference between the bf16 abd the int8 quantization of the model. Dies it impact the understanding of the prompt or add any artifacts that wouldn't be in bf16
TheBigBlockPC
changed discussion title from
Difference between bf16 abd int8
to Difference between bf16 and int8
@TheBigBlockPC Int8 should be slightly lower quality and will produce slightly different results, but nothing too major.
Could you please provide a video generated with the bf16 abd int8 quantization (if possible) and name some differences in quality and prompt understanding. how extreme is speed difference between bf16 and int8