Edit model card

iMatrix GGUFs for https://huggingface.co/nvidia/Llama-3.1-Nemotron-70B-Instruct-HF

iMat generated using Kalomaze's groups_merged.txt

Downloads last month
233
GGUF
Model size
70.6B params
Architecture
llama
Inference Examples
Inference API (serverless) has been turned off for this model.

Model tree for MarsupialAI/Llama-3.1-Nemotron-70B-Instruct_iMat_GGUF

Quantized
(85)
this model

Dataset used to train MarsupialAI/Llama-3.1-Nemotron-70B-Instruct_iMat_GGUF