GGUF
Inference Endpoints
imatrix
ThomasBaruzier's picture
Upload Llama-3.1-Minitron-4B-Width-Base-IQ4_NL.gguf
7600cbb verified