LLM decoding Chain-of-Thought Reasoning Without Prompting Paper • 2402.10200 • Published Feb 15 • 99 PRefLexOR: Preference-based Recursive Language Modeling for Exploratory Optimization of Reasoning and Agentic Thinking Paper • 2410.12375 • Published 20 days ago • 2
PRefLexOR: Preference-based Recursive Language Modeling for Exploratory Optimization of Reasoning and Agentic Thinking Paper • 2410.12375 • Published 20 days ago • 2