GGUF
Inference Endpoints
imatrix
ThomasBaruzier's picture
Upload Llama-3.1-Minitron-4B-Width-Base-Q3_K_S.gguf
2e66c0b verified