GGUF
Inference Endpoints
imatrix

Commit History

Upload Llama-3.1-Minitron-4B-Width-Base-IQ3_XXS.gguf
bfc98e0
verified

ThomasBaruzier commited on

Upload Llama-3.1-Minitron-4B-Width-Base-IQ3_S.gguf
67e7392
verified

ThomasBaruzier commited on

Upload Llama-3.1-Minitron-4B-Width-Base-IQ2_M.gguf
cbba92f
verified

ThomasBaruzier commited on

Upload Llama-3.1-Minitron-4B-Width-Base-IQ2_S.gguf
217a4b5
verified

ThomasBaruzier commited on

Upload Llama-3.1-Minitron-4B-Width-Base-Q2_K_S.gguf
e735694
verified

ThomasBaruzier commited on

Upload Llama-3.1-Minitron-4B-Width-Base-IQ2_XS.gguf
3a8eea0
verified

ThomasBaruzier commited on

Upload Llama-3.1-Minitron-4B-Width-Base-Q2_K.gguf
12132ac
verified

ThomasBaruzier commited on

Upload Llama-3.1-Minitron-4B-Width-Base-IQ2_XXS.gguf
dd6d82d
verified

ThomasBaruzier commited on

Upload Llama-3.1-Minitron-4B-Width-Base-IQ1_S.gguf
7a881e3
verified

ThomasBaruzier commited on

Upload Llama-3.1-Minitron-4B-Width-Base-IQ1_M.gguf
1e7170f
verified

ThomasBaruzier commited on