Overconfident Errors Need Stronger Correction: Asymmetric Confidence Penalties for Reinforcement Learning

Yuanda Xu; Hejian Sang; Zhengze Zhou; Ran He; Zhipeng Wang

arXiv:2602.21420·cs.LG·February 26, 2026

Overconfident Errors Need Stronger Correction: Asymmetric Confidence Penalties for Reinforcement Learning

Yuanda Xu, Hejian Sang, Zhengze Zhou, Ran He, Zhipeng Wang

PDF

Open Access

TL;DR

This paper introduces ACE, a novel asymmetric confidence penalty for reinforcement learning in language models, which dynamically adjusts error correction to improve reasoning accuracy and diversity.

Contribution

The paper proposes ACE, a confidence-aware error penalty that selectively moderates overconfident errors, enhancing RL training for large language models.

Findings

01

ACE improves Pass@k accuracy across multiple models and benchmarks.

02

ACE complements existing methods without disrupting their core mechanisms.

03

Experimental results show consistent performance gains in reasoning tasks.

Abstract

Reinforcement Learning with Verifiable Rewards (RLVR) has become the leading paradigm for enhancing reasoning in Large Language Models (LLMs). However, standard RLVR algorithms suffer from a well-documented pathology: while they improve Pass@1 accuracy through sharpened sampling, they simultaneously narrow the model's reasoning boundary and reduce generation diversity. We identify a root cause that existing methods overlook: the uniform penalization of errors. Current approaches -- whether data-filtering methods that select prompts by difficulty, or advantage normalization schemes -- treat all incorrect rollouts within a group identically. We show that this uniformity allows overconfident errors (incorrect reasoning paths that the RL process has spuriously reinforced) to persist and monopolize probability mass, ultimately suppressing valid exploratory trajectories. To address this, we…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Explainable Artificial Intelligence (XAI) · Natural Language Processing Techniques