GGUF
Inference Endpoints
imatrix
Llama-3.1-Minitron-4B-Width-Base-GGUF / Llama-3.1-Minitron-4B-Width-Base-IQ2_XXS.gguf

Commit History

Upload Llama-3.1-Minitron-4B-Width-Base-IQ2_XXS.gguf
dd6d82d
verified

ThomasBaruzier commited on