Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
QuantFactory
/
Llama-3.1-Minitron-4B-Width-Base-GGUF
like
1
Follow
Quant Factory
221
GGUF
Inference Endpoints
arxiv:
2408.11796
arxiv:
2009.03300
arxiv:
2407.14679
License:
nvidia-open-model-license
Model card
Files
Files and versions
Community
Deploy
Use this model
main
Llama-3.1-Minitron-4B-Width-Base-GGUF
1 contributor
History:
16 commits
aashish1904
Upload Llama-3.1-Minitron-4B-Width-Base.Q4_K_S.gguf with huggingface_hub
3d99350
verified
3 months ago
.gitattributes
2.64 kB
Upload Llama-3.1-Minitron-4B-Width-Base.Q4_K_S.gguf with huggingface_hub
3 months ago
Llama-3.1-Minitron-4B-Width-Base.Q2_K.gguf
1.84 GB
LFS
Upload Llama-3.1-Minitron-4B-Width-Base.Q2_K.gguf with huggingface_hub
3 months ago
Llama-3.1-Minitron-4B-Width-Base.Q3_K_L.gguf
2.46 GB
LFS
Upload Llama-3.1-Minitron-4B-Width-Base.Q3_K_L.gguf with huggingface_hub
3 months ago
Llama-3.1-Minitron-4B-Width-Base.Q3_K_M.gguf
2.3 GB
LFS
Upload Llama-3.1-Minitron-4B-Width-Base.Q3_K_M.gguf with huggingface_hub
3 months ago
Llama-3.1-Minitron-4B-Width-Base.Q3_K_S.gguf
2.1 GB
LFS
Upload Llama-3.1-Minitron-4B-Width-Base.Q3_K_S.gguf with huggingface_hub
3 months ago
Llama-3.1-Minitron-4B-Width-Base.Q4_0.gguf
2.65 GB
LFS
Upload Llama-3.1-Minitron-4B-Width-Base.Q4_0.gguf with huggingface_hub
3 months ago
Llama-3.1-Minitron-4B-Width-Base.Q4_1.gguf
2.91 GB
LFS
Upload Llama-3.1-Minitron-4B-Width-Base.Q4_1.gguf with huggingface_hub
3 months ago
Llama-3.1-Minitron-4B-Width-Base.Q4_K_M.gguf
2.78 GB
LFS
Upload Llama-3.1-Minitron-4B-Width-Base.Q4_K_M.gguf with huggingface_hub
3 months ago
Llama-3.1-Minitron-4B-Width-Base.Q4_K_S.gguf
2.66 GB
LFS
Upload Llama-3.1-Minitron-4B-Width-Base.Q4_K_S.gguf with huggingface_hub
3 months ago
Llama-3.1-Minitron-4B-Width-Base.Q5_0.gguf
3.16 GB
LFS
Upload Llama-3.1-Minitron-4B-Width-Base.Q5_0.gguf with huggingface_hub
3 months ago
Llama-3.1-Minitron-4B-Width-Base.Q5_1.gguf
3.42 GB
LFS
Upload Llama-3.1-Minitron-4B-Width-Base.Q5_1.gguf with huggingface_hub
3 months ago
Llama-3.1-Minitron-4B-Width-Base.Q5_K_M.gguf
3.23 GB
LFS
Upload Llama-3.1-Minitron-4B-Width-Base.Q5_K_M.gguf with huggingface_hub
3 months ago
Llama-3.1-Minitron-4B-Width-Base.Q5_K_S.gguf
3.16 GB
LFS
Upload Llama-3.1-Minitron-4B-Width-Base.Q5_K_S.gguf with huggingface_hub
3 months ago
Llama-3.1-Minitron-4B-Width-Base.Q6_K.gguf
3.71 GB
LFS
Upload Llama-3.1-Minitron-4B-Width-Base.Q6_K.gguf with huggingface_hub
3 months ago
Llama-3.1-Minitron-4B-Width-Base.Q8_0.gguf
4.8 GB
LFS
Upload Llama-3.1-Minitron-4B-Width-Base.Q8_0.gguf with huggingface_hub
3 months ago
README.md
6.19 kB
Upload README.md with huggingface_hub
3 months ago