view article Article Improving Hugging Face Training Efficiency Through Packing with Flash Attention about 1 month ago • 19
🔍 Daily Picks in Interpretability & Analysis of LMs Collection Outstanding research in interpretability and evaluation of language models, summarized • 70 items • Updated 9 days ago • 84
view article Article Introducing RWKV — An RNN with the advantages of a transformer May 15, 2023 • 12
Model Internals-based Answer Attribution for Trustworthy Retrieval-Augmented Generation Paper • 2406.13663 • Published Jun 19 • 7
A Simple and Effective L_2 Norm-Based Strategy for KV Cache Compression Paper • 2406.11430 • Published Jun 17 • 23
view article Article The Hallucinations Leaderboard, an Open Effort to Measure Hallucinations in Large Language Models Jan 29 • 12