Data Engineering for Scaling Language Models to 128K Context Paper • 2402.10171 • Published Feb 15 • 21
LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens Paper • 2402.13753 • Published Feb 21 • 111
Tulu V2 Suite Collection The set of models associated with the paper "Camels in a Changing Climate: Enhancing LM Adaptation with Tulu 2" • 19 items • Updated Sep 25 • 43
🚀GGUF Collection Llama.cpp compatible models, can be used on CPUs and GPUs! • 838 items • Updated about 13 hours ago • 34
BLING Models Collection Small CPU-based RAG-optimized, instruct-following 1B-3B parameter models • 27 items • Updated 9 days ago • 25
Recent models: last 100 repos, sorted by creation date Collection The last 100 repos I have created. Sorted by creation date descending, so the most recently created repos appear at the top. • 121 items • Updated Jan 31 • 501