DeepSeek LLM: Scaling Open-Source Language Models with Longtermism Paper • 2401.02954 • Published Jan 5 • 40
Perspectives on the State and Future of Deep Learning -- 2023 Paper • 2312.09323 • Published Dec 7, 2023 • 5
Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization Paper • 2405.15071 • Published May 23 • 37
Sibyl: Simple yet Effective Agent Framework for Complex Real-world Reasoning Paper • 2407.10718 • Published Jul 15 • 17
LAB-Bench: Measuring Capabilities of Language Models for Biology Research Paper • 2407.10362 • Published Jul 14 • 4
SciCode: A Research Coding Benchmark Curated by Scientists Paper • 2407.13168 • Published Jul 18 • 13
MindSearch: Mimicking Human Minds Elicits Deep AI Searcher Paper • 2407.20183 • Published Jul 29 • 37
Building and better understanding vision-language models: insights and future directions Paper • 2408.12637 • Published Aug 22 • 116
Promptriever: Instruction-Trained Retrievers Can Be Prompted Like Language Models Paper • 2409.11136 • Published Sep 17 • 21