Llama 3.2 Collection This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 15 items • Updated 20 days ago • 462
Meta-Llama-3.1-Quantized Collection Collection of quantized Llama 3.1 models (8B & 70B versions for now), using bitsandbites. • 4 items • Updated Aug 28 • 1
view article Article A Gentle Introduction to 8-bit Matrix Multiplication for transformers at scale using transformers, accelerate and bitsandbytes Aug 17, 2022 • 62