Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image Synthesis Paper • 2410.08261 • Published Oct 10 • 49
Llama 3.2 Collection This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 15 items • Updated 28 days ago • 482
Llama3-8B-1.58 Collection A trio of powerful models: fine-tuned from Llama3-8b-Instruct, with BitNet architecture! • 3 items • Updated Sep 14 • 12
view article Article Probabilistic Fractal Activation Function (P-FAF) and Its Advantages Over Traditional Word Vectorization By TuringsSolutions • Feb 8 • 5
Text2Avatar: Text to 3D Human Avatar Generation with Codebook-Driven Body Controllable Attribute Paper • 2401.00711 • Published Jan 1 • 2
Arcee's MergeKit: A Toolkit for Merging Large Language Models Paper • 2403.13257 • Published Mar 20 • 20
Quiet-STaR: Language Models Can Teach Themselves to Think Before Speaking Paper • 2403.09629 • Published Mar 14 • 73
ShortGPT: Layers in Large Language Models are More Redundant Than You Expect Paper • 2403.03853 • Published Mar 6 • 62
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits Paper • 2402.17764 • Published Feb 27 • 603
SliceGPT: Compress Large Language Models by Deleting Rows and Columns Paper • 2401.15024 • Published Jan 26 • 69
LLM in a flash: Efficient Large Language Model Inference with Limited Memory Paper • 2312.11514 • Published Dec 12, 2023 • 258
StableLM (.gguf) Collection Models based on StableLM Models by Stability AI • 19 items • Updated Nov 27, 2023 • 3
Honorable mentions Collection Some models I've made and I liked but isn't part of a serie. • 10 items • Updated Feb 4 • 6