No Need for Explanations: LLMs can implicitly learn from mistakes in-context

Lisa Alazraki; Maximilian Mozes; Jon Ander Campos; Tan Yi-Chern; Marek Rei; Max Bartolo

arXiv:2502.08550·cs.CL·September 23, 2025

No Need for Explanations: LLMs can implicitly learn from mistakes in-context

Lisa Alazraki, Maximilian Mozes, Jon Ander Campos, Tan Yi-Chern, Marek Rei, Max Bartolo

PDF

Open Access 1 Video

TL;DR

This paper reveals that Large Language Models perform better in reasoning tasks when they are not provided with explicit rationales, suggesting they can learn effectively from mistakes implicitly without detailed explanations.

Contribution

The study challenges the assumption that explicit rationales are necessary, showing models excel with minimal context and highlighting the potential of implicit learning from errors.

Findings

01

Models outperform chain-of-thought prompting without rationales

02

Explicit rationales can over-constrain models and reduce learning benefits

03

Incorrect answers alone help models learn more effectively

Abstract

Showing incorrect answers to Large Language Models (LLMs) is a popular strategy to improve their performance in reasoning-intensive tasks. It is widely assumed that, in order to be helpful, the incorrect answers must be accompanied by comprehensive rationales, explicitly detailing where the mistakes are and how to correct them. However, in this work we present a counterintuitive finding: we observe that LLMs perform better in math reasoning tasks when these rationales are eliminated from the context and models are left to infer on their own what makes an incorrect answer flawed. This approach also substantially outperforms chain-of-thought prompting in our evaluations. These results are consistent across LLMs of different sizes and varying reasoning abilities. To gain an understanding of why LLMs learn from mistakes more effectively without explicit corrective rationales, we perform a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No Need for Explanations: LLMs can implicitly learn from mistakes in-context· underline

Taxonomy

TopicsBiomedical Text Mining and Ontologies · Imbalanced Data Classification Techniques