Self-Correcting Large Language Models: Generation vs. Multiple Choice

Hossein A. Rahmani; Satyapriya Krishna; Xi Wang; Mohammadmehdi Naghiaei; Emine Yilmaz

arXiv:2511.09381·cs.CL·November 13, 2025

Self-Correcting Large Language Models: Generation vs. Multiple Choice

Hossein A. Rahmani, Satyapriya Krishna, Xi Wang, Mohammadmehdi Naghiaei, Emine Yilmaz

PDF

Open Access

TL;DR

This paper systematically compares self-correction in large language models across open-ended generation and multiple-choice tasks, revealing distinct behaviors and implications for designing effective correction mechanisms.

Contribution

It provides a comprehensive analysis of how self-correction varies between generation and multiple-choice paradigms in LLMs, highlighting the importance of task structure.

Findings

01

Generation benefits from reinterpretation and refinement.

02

Multiple-choice leverages clear solution boundaries.

03

Different failure modes observed in each paradigm.

Abstract

Large language models have recently demonstrated remarkable abilities to self-correct their responses through iterative refinement, often referred to as self-consistency or self-reflection. However, the dynamics of this self-correction mechanism may differ substantially depending on whether the model is tasked with open-ended text generation or with selecting the most appropriate response from multiple predefined options. In this paper, we conduct a systematic investigation of these two paradigms by comparing performance trends and error-correction behaviors across various natural language understanding and reasoning tasks, covering language models of different scales and families. Our experimental results reveal distinct patterns of improvement and failure modes: \textit{While open-ended generation often benefits from the flexibility of re-interpretation and compositional refinement,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Artificial Intelligence in Healthcare and Education · Explainable Artificial Intelligence (XAI)