Correctness-Optimized Residual Activation Lens (CORAL): Transferrable and Calibration-Aware Inference-Time Steering

Miranda Muqing Miao; Young-Min Cho; Lyle Ungar

arXiv:2602.06022·cs.LG·February 6, 2026

Correctness-Optimized Residual Activation Lens (CORAL): Transferrable and Calibration-Aware Inference-Time Steering

Miranda Muqing Miao, Young-Min Cho, Lyle Ungar

PDF

Open Access

TL;DR

CORAL is an inference-time steering method that uses regularized probes to extract correctness signals from model internals, significantly improving accuracy and calibration across multiple models and benchmarks without retraining.

Contribution

Introduces CORAL, a novel inference-time steering technique leveraging regularized probes to enhance correctness detection and calibration in large language models.

Findings

01

Improves accuracy by 10% on average across models

02

Reduces expected calibration error (ECE) by 50%

03

Transfers gains to multiple benchmarks without retraining

Abstract

Large language models (LLMs) exhibit persistent miscalibration, especially after instruction tuning and preference alignment. Modified training objectives can improve calibration, but retraining is expensive. Inference-time steering offers a lightweight alternative, yet most existing methods optimize proxies for correctness rather than correctness itself. We introduce CORAL (Correctness-Optimized Residual Activation Lens), a regularized inference-time steering method that captures distributed correctness signals from model internal activations using weight-decay MLP probes. We evaluate CORAL across three 7B-parameter models and find that it consistently improves accuracy by 10\% and expected calibration error (ECE) by 50\% on average. We additionally demonstrate that these gains transfer without retraining to the complete published test sets of four held-out benchmarks (ARC-Challenge,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Machine Learning in Materials Science · Explainable Artificial Intelligence (XAI)