Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
QuantFactory
/
Llama-3.1-Minitron-4B-Width-Base-GGUF
like
1
Follow
Quant Factory
222
GGUF
Inference Endpoints
arxiv:
2408.11796
arxiv:
2009.03300
arxiv:
2407.14679
License:
nvidia-open-model-license
Model card
Files
Files and versions
Community
Deploy
Use this model
7b89c22
Llama-3.1-Minitron-4B-Width-Base-GGUF
1 contributor
History:
8 commits
aashish1904
Upload Llama-3.1-Minitron-4B-Width-Base.Q5_K_M.gguf with huggingface_hub
7b89c22
verified
3 months ago
.gitattributes
Safe
2 kB
Upload Llama-3.1-Minitron-4B-Width-Base.Q5_K_M.gguf with huggingface_hub
3 months ago
Llama-3.1-Minitron-4B-Width-Base.Q4_0.gguf
Safe
2.65 GB
LFS
Upload Llama-3.1-Minitron-4B-Width-Base.Q4_0.gguf with huggingface_hub
3 months ago
Llama-3.1-Minitron-4B-Width-Base.Q4_1.gguf
Safe
2.91 GB
LFS
Upload Llama-3.1-Minitron-4B-Width-Base.Q4_1.gguf with huggingface_hub
3 months ago
Llama-3.1-Minitron-4B-Width-Base.Q4_K_M.gguf
Safe
2.78 GB
LFS
Upload Llama-3.1-Minitron-4B-Width-Base.Q4_K_M.gguf with huggingface_hub
3 months ago
Llama-3.1-Minitron-4B-Width-Base.Q5_K_M.gguf
Safe
3.23 GB
LFS
Upload Llama-3.1-Minitron-4B-Width-Base.Q5_K_M.gguf with huggingface_hub
3 months ago
Llama-3.1-Minitron-4B-Width-Base.Q6_K.gguf
Safe
3.71 GB
LFS
Upload Llama-3.1-Minitron-4B-Width-Base.Q6_K.gguf with huggingface_hub
3 months ago
Llama-3.1-Minitron-4B-Width-Base.Q8_0.gguf
Safe
4.8 GB
LFS
Upload Llama-3.1-Minitron-4B-Width-Base.Q8_0.gguf with huggingface_hub
3 months ago
README.md
Safe
6.19 kB
Upload README.md with huggingface_hub
3 months ago