Favorite Low-End Models

Crataco 's Collections

updated Aug 3

These quantized models have a smaller memory footprint, but acceptable quality.

legraphista/gemma-2-2b-it-IMat-GGUF

Text Generation • Updated Jul 31 • 234 • 1

Note Claims to outperform GPT-3.5. MMLU Pro scores of the Open LLM Leaderboard puts it slightly below OpenHermes 2.5. 3 GB RAM: Go for IQ3_M quant (tested with my phone)
bartowski/Phi-3.1-mini-4k-instruct-GGUF

Text Generation • Updated Aug 3 • 4.99k • 41

Note Outperforms Gemma-2 2B in terms of MMLU Pro score (33.58% versus 17.22% according to the Open LLM Leaderboard), but is slightly bigger at 3.8B parameters vs. 2.6B, and not so great at roleplay.
bartowski/Gemma-2-9B-It-SPPO-Iter3-GGUF

Text Generation • Updated Jul 15 • 3.95k • 53

Note My goto "jack-of-all-trades" model, whether it be my studying or roleplaying partner.