GGUF
Inference Endpoints
imatrix

Commit History

Upload Llama-3.1-Minitron-4B-Width-Base-IQ4_NL.gguf
7600cbb
verified

ThomasBaruzier commited on

Upload Llama-3.1-Minitron-4B-Width-Base-Q3_K_M.gguf
755f25b
verified

ThomasBaruzier commited on

Upload Llama-3.1-Minitron-4B-Width-Base-IQ3_XS.gguf
48f3f39
verified

ThomasBaruzier commited on

Upload Llama-3.1-Minitron-4B-Width-Base-Q3_K_L.gguf
cd880fb
verified

ThomasBaruzier commited on

Upload Llama-3.1-Minitron-4B-Width-Base-IQ3_M.gguf
0b8339f
verified

ThomasBaruzier commited on

Upload Llama-3.1-Minitron-4B-Width-Base-Q3_K_S.gguf
2e66c0b
verified

ThomasBaruzier commited on

Upload Llama-3.1-Minitron-4B-Width-Base-IQ3_XXS.gguf
bfc98e0
verified

ThomasBaruzier commited on

Upload Llama-3.1-Minitron-4B-Width-Base-IQ3_S.gguf
67e7392
verified

ThomasBaruzier commited on

Upload Llama-3.1-Minitron-4B-Width-Base-IQ2_M.gguf
cbba92f
verified

ThomasBaruzier commited on

Upload Llama-3.1-Minitron-4B-Width-Base-IQ2_S.gguf
217a4b5
verified

ThomasBaruzier commited on

Upload Llama-3.1-Minitron-4B-Width-Base-Q2_K_S.gguf
e735694
verified

ThomasBaruzier commited on

Upload Llama-3.1-Minitron-4B-Width-Base-IQ2_XS.gguf
3a8eea0
verified

ThomasBaruzier commited on

Upload Llama-3.1-Minitron-4B-Width-Base-Q2_K.gguf
12132ac
verified

ThomasBaruzier commited on

Upload Llama-3.1-Minitron-4B-Width-Base-IQ2_XXS.gguf
dd6d82d
verified

ThomasBaruzier commited on

Upload Llama-3.1-Minitron-4B-Width-Base-IQ1_S.gguf
7a881e3
verified

ThomasBaruzier commited on

Upload Llama-3.1-Minitron-4B-Width-Base-IQ1_M.gguf
1e7170f
verified

ThomasBaruzier commited on