Quiet-STaR: Language Models Can Teach Themselves to Think Before
Speaking
Paper
• 2403.09629
• Published
• 79
V-STaR: Training Verifiers for Self-Taught Reasoners
Paper
• 2402.06457
• Published
• 9
Learn Beyond The Answer: Training Language Models with Reflection for
Mathematical Reasoning
Paper
• 2406.12050
• Published
• 19
Agent Q: Advanced Reasoning and Learning for Autonomous AI Agents
Paper
• 2408.07199
• Published
• 22
Training Language Models to Self-Correct via Reinforcement Learning
Paper
• 2409.12917
• Published
• 140
Stream of Search (SoS): Learning to Search in Language
Paper
• 2404.03683
• Published
• 30
Let's Verify Step by Step
Paper
• 2305.20050
• Published
• 11
STaR: Bootstrapping Reasoning With Reasoning
Paper
• 2203.14465
• Published
• 9
LLaMA-Berry: Pairwise Optimization for O1-like Olympiad-Level
Mathematical Reasoning
Paper
• 2410.02884
• Published
• 54
Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth
Approach
Paper
• 2502.05171
• Published
• 152