Why the model is so large?

#1
by JohnLaw - opened

a 2b model in gguf format is 10Gb?

Google org

That's right! The weights are store in fp32 precision (see here) meaning 2.6B params * 4 bytes per param = ~10GB.
image.png

Sign up or log in to comment