long context LLM - a ZihanWang99 Collection

ZihanWang99 's Collections

long context LLM

MOE

COT

reading comprehension

Code Generation

long context LLM

updated Feb 19

E^2-LLM: Efficient and Extreme Length Extension of Large Language Models

Paper • 2401.06951 • Published Jan 13 • 24
Extending LLMs' Context Window with 100 Samples

Paper • 2401.07004 • Published Jan 13 • 14
Soaring from 4K to 400K: Extending LLM's Context with Activation Beacon

Paper • 2401.03462 • Published Jan 7 • 26
The Hedgehog & the Porcupine: Expressive Linear Attentions with Softmax Mimicry

Paper • 2402.04347 • Published Feb 6 • 13
LongAlign: A Recipe for Long Context Alignment of Large Language Models

Paper • 2401.18058 • Published Jan 31 • 21