Mixture-of-Agents Enhances Large Language Model Capabilities Paper • 2406.04692 • Published Jun 7 • 55
The Prompt Report: A Systematic Survey of Prompting Techniques Paper • 2406.06608 • Published Jun 6 • 53
FineMath: A Fine-Grained Mathematical Evaluation Benchmark for Chinese Large Language Models Paper • 2403.07747 • Published Mar 12 • 1
A Careful Examination of Large Language Model Performance on Grade School Arithmetic Paper • 2405.00332 • Published May 1 • 30
GiT: Towards Generalist Vision Transformer through Universal Language Interface Paper • 2403.09394 • Published Mar 14 • 25
Premise Order Matters in Reasoning with Large Language Models Paper • 2402.08939 • Published Feb 14 • 25
Computing Power and the Governance of Artificial Intelligence Paper • 2402.08797 • Published Feb 13 • 11
PokéLLMon: A Human-Parity Agent for Pokémon Battles with Large Language Models Paper • 2402.01118 • Published Feb 2 • 29
MoE-LLaVA: Mixture of Experts for Large Vision-Language Models Paper • 2401.15947 • Published Jan 29 • 48
HotpotQA: A Dataset for Diverse, Explainable Multi-hop Question Answering Paper • 1809.09600 • Published Sep 25, 2018 • 2
The Flan Collection: Designing Data and Methods for Effective Instruction Tuning Paper • 2301.13688 • Published Jan 31, 2023 • 8
HellaSwag: Can a Machine Really Finish Your Sentence? Paper • 1905.07830 • Published May 19, 2019 • 4