rubbyninja's picture

10

rubbyninja

rubbyninja

·

AI & ML interests

None yet

Organizations

None yet

rubbyninja's activity

upvoted a paper 3 days ago

GSM-Symbolic: Understanding the Limitations of Mathematical Reasoning in Large Language Models

Paper • 2410.05229 • Published 9 days ago • 14

upvoted a paper 17 days ago

Aligning Machine and Human Visual Representations across Abstraction Levels

Paper • 2409.06509 • Published Sep 10 • 1

upvoted a paper 25 days ago

Training Language Models to Self-Correct via Reinforcement Learning

Paper • 2409.12917 • Published 27 days ago • 131

upvoted a paper 29 days ago

Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters

Paper • 2408.03314 • Published Aug 6 • 33

upvoted 2 papers about 1 month ago

Large Language Monkeys: Scaling Inference Compute with Repeated Sampling

Paper • 2407.21787 • Published Jul 31 • 3

Quiet-STaR: Language Models Can Teach Themselves to Think Before Speaking

Paper • 2403.09629 • Published Mar 14 • 72

upvoted a paper about 2 months ago

Prompt Cache: Modular Attention Reuse for Low-Latency Inference

Paper • 2311.04934 • Published Nov 7, 2023 • 28

upvoted 2 papers 2 months ago

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

Paper • 2405.04434 • Published May 7 • 13

DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models

Paper • 2401.06066 • Published Jan 11 • 42

upvoted a paper 3 months ago

STaR: Bootstrapping Reasoning With Reasoning

Paper • 2203.14465 • Published Mar 28, 2022 • 2