No Need for Explanations: LLMs can implicitly learn from mistakes in-context
Lisa Alazraki, Maximilian Mozes, Jon Ander Campos, Tan Yi-Chern, Marek Rei, Max Bartolo

TL;DR
This paper reveals that Large Language Models perform better in reasoning tasks when they are not provided with explicit rationales, suggesting they can learn effectively from mistakes implicitly without detailed explanations.
Contribution
The study challenges the assumption that explicit rationales are necessary, showing models excel with minimal context and highlighting the potential of implicit learning from errors.
Findings
Models outperform chain-of-thought prompting without rationales
Explicit rationales can over-constrain models and reduce learning benefits
Incorrect answers alone help models learn more effectively
Abstract
Showing incorrect answers to Large Language Models (LLMs) is a popular strategy to improve their performance in reasoning-intensive tasks. It is widely assumed that, in order to be helpful, the incorrect answers must be accompanied by comprehensive rationales, explicitly detailing where the mistakes are and how to correct them. However, in this work we present a counterintuitive finding: we observe that LLMs perform better in math reasoning tasks when these rationales are eliminated from the context and models are left to infer on their own what makes an incorrect answer flawed. This approach also substantially outperforms chain-of-thought prompting in our evaluations. These results are consistent across LLMs of different sizes and varying reasoning abilities. To gain an understanding of why LLMs learn from mistakes more effectively without explicit corrective rationales, we perform a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsBiomedical Text Mining and Ontologies · Imbalanced Data Classification Techniques
