Addition is All You Need for Energy-efficient Language Models Paper • 2410.00907 • Published 15 days ago • 130
Phantom of Latent for Large Language and Vision Models Paper • 2409.14713 • Published 24 days ago • 27
To CoT or not to CoT? Chain-of-thought helps mainly on math and symbolic reasoning Paper • 2409.12183 • Published 28 days ago • 36
Windows Agent Arena: Evaluating Multi-Modal OS Agents at Scale Paper • 2409.08264 • Published Sep 12 • 42
MemoRAG: Moving towards Next-Gen RAG Via Memory-Inspired Knowledge Discovery Paper • 2409.05591 • Published Sep 9 • 27
LazyLLM: Dynamic Token Pruning for Efficient Long Context LLM Inference Paper • 2407.14057 • Published Jul 19 • 43