Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
neuralmagic
/
Meta-Llama-3.1-405B-Instruct-quantized.w4a16
like
12
Follow
Neural Magic
166
Text Generation
Safetensors
8 languages
llama
int4
vllm
conversational
compressed-tensors
arxiv:
2210.17323
License:
llama3.1
Model card
Files
Files and versions
Community
4
Train
main
Meta-Llama-3.1-405B-Instruct-quantized.w4a16
Commit History
Update README.md
9db6306
verified
alexmarques
commited on
Oct 10
Updated compression_config to quantization_config
d0c9cb9
verified
mgoin
commited on
Oct 9
Update README.md
6b753ea
verified
alexmarques
commited on
Sep 30
Update README.md
7d1f72d
verified
alexmarques
commited on
Aug 13
Update README.md
bb83fe4
verified
abhinavnmagic
commited on
Aug 13
Upload folder using huggingface_hub
423c174
verified
abhinavnmagic
commited on
Aug 13
Update README.md
91a872b
verified
abhinavnmagic
commited on
Aug 12
Update README.md
a8c9e50
verified
abhinavnmagic
commited on
Aug 9
Create README.md
2abcd4a
verified
abhinavnmagic
commited on
Aug 9
Upload folder using huggingface_hub
eadc452
verified
abhinavnmagic
commited on
Aug 9
initial commit
74fef34
verified
abhinavnmagic
commited on
Aug 9