Edit Models filters

Inference status

Misc

Inference Endpoints

text-generation-inference

AutoTrain Compatible

4-bit precision

Misc with no match

text-embeddings-inference

8-bit precision

Carbon Emissions

Mixture of Experts

Models

82

Full-text search

Active filters: llama.cpp

MrOvkill/gemma-2-inference-endpoint-GGUF

Text Generation • Updated Mar 11 • 2

google/gemma-1.1-7b-it-GGUF

Updated Jun 27 • 4 • 21

google/gemma-1.1-2b-it-GGUF

Updated Jun 27 • 11 • 20

HirCoir/openchat-3.5-0106-GGUF

Updated Apr 29 • 77

google/codegemma-7b-GGUF

Text Generation • Updated Jun 27 • 15 • 16

google/codegemma-7b-it-GGUF

Text Generation • Updated Jun 27 • 72 • 50

pacozaa/bonito-gguf

Updated Apr 14 • 3

pmking27/PrathameshLLM-2B-GGUF

Updated Apr 9 • 2.25k • 1

teleprint-me/cyberpunk-valerie-v0.1

Text Generation • Updated Apr 18 • 47 • 1

qwp4w3hyb/Meta-Llama-3-8B-Instruct-iMat-GGUF

Text Generation • Updated Apr 29 • 1.21k • 6

HirCoir/Phi-3-mini-4k-instruct-gguf

Updated Apr 29 • 94

asiansoul/Llama-3-Open-Ko-Linear-8B-GGUF

Updated Apr 28 • 4

mgonzs13/Mistroll-7B-v2.2-GGUF

Text Generation • Updated Apr 29 • 36

HirCoir/openbuddy-mistral2-7b-v20.3-32k-GGUF

Updated May 1 • 54

HirCoir/Phi-3-mini-128k-instruct-GGUF

Updated May 6 • 170

mgonzs13/ladybird-base-7B-v8-GGUF

Text Generation • Updated Apr 29 • 36

google/codegemma-1.1-2b-GGUF

Text Generation • Updated Jun 27 • 10

google/codegemma-1.1-7b-it-GGUF

Text Generation • Updated Jun 27 • 10 • 14

HirCoir/TinyDolphin-2.8-1.1b-GGUF

Updated May 1 • 88 • 2

HirCoir/TinyLlama-1.1B-Chat-v1.0-GGUF

Updated May 1 • 112 • 1

mgonzs13/TextBase-7B-v0.1-GGUF

Text Generation • Updated Jun 11 • 49

QuantFactory/TextBase-7B-v0.1-GGUF

Text Generation • Updated Jun 18 • 232

njwright92/ComicBot_v.2-gguf

Text Generation • Updated Aug 30 • 21

Irathernotsay/qwen2-1.5B-medical_qa-Finetune

Text Generation • Updated Jul 17 • 7

palusi/Qwen2-0.5B-Instruct-GGUF

Updated Jun 27 • 340

XavierSpycy/Meta-Llama-3-8B-Instruct-zh-10k

Text Generation • Updated Jul 9 • 21

ruslanmv/Medical-Llama3-v2-Q4_K_M-GGUF

Updated Jun 30 • 14

XavierSpycy/Meta-Llama-3-8B-Instruct-zh-10k-GGUF

Text Generation • Updated Jul 9 • 31

XavierSpycy/Meta-Llama-3-8B-Instruct-zh-10k-GPTQ

Text Generation • Updated Jul 9 • 5

zhhan/Phi-3-mini-4k-instruct_gguf_derived

Summarization • Updated Jul 2 • 47