Reasoning - a donatoni Collection

donatoni 's Collections

Reasoning

updated Mar 15

Teaching Large Language Models to Reason with Reinforcement Learning

Paper • 2403.04642 • Published Mar 7 • 46