GGUF
Inference Endpoints
imatrix
ThomasBaruzier commited on
Commit
a8b6592
1 Parent(s): 1894899

Upload Llama-3.1-Minitron-4B-Width-Base-Q4_0_4_8.gguf

Browse files
.gitattributes CHANGED
@@ -63,3 +63,4 @@ Llama-3.1-Minitron-4B-Width-Base-Q5_0.gguf filter=lfs diff=lfs merge=lfs -text
63
  Llama-3.1-Minitron-4B-Width-Base-Q5_1.gguf filter=lfs diff=lfs merge=lfs -text
64
  Llama-3.1-Minitron-4B-Width-Base-Q8_0.gguf filter=lfs diff=lfs merge=lfs -text
65
  Llama-3.1-Minitron-4B-Width-Base-Q4_0_4_4.gguf filter=lfs diff=lfs merge=lfs -text
 
 
63
  Llama-3.1-Minitron-4B-Width-Base-Q5_1.gguf filter=lfs diff=lfs merge=lfs -text
64
  Llama-3.1-Minitron-4B-Width-Base-Q8_0.gguf filter=lfs diff=lfs merge=lfs -text
65
  Llama-3.1-Minitron-4B-Width-Base-Q4_0_4_4.gguf filter=lfs diff=lfs merge=lfs -text
66
+ Llama-3.1-Minitron-4B-Width-Base-Q4_0_4_8.gguf filter=lfs diff=lfs merge=lfs -text
Llama-3.1-Minitron-4B-Width-Base-Q4_0_4_8.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ebece1c16f8ff68d854ce47d2dde5c96d8dcfc4d8f3f5794bc5583f08c4c6843
3
+ size 2648521376