Description

These are GGUF model format files for the rhysjones/Phi-3-mini-mango-1 Phi-3 4k model.

Conversion process

The useful conversion script GGUF-n-Go by thesven was used along with llama.cpp to generate the different quantized sizes for the model.

GGUF

Model size

3.82B params

Architecture

phi3

2-bit

3-bit

4-bit

5-bit

6-bit

8-bit

16-bit

Inference Examples

Inference API (serverless) has been turned off for this model.

Base model

Quantized

(1)

this model