Lost in the Noise: How Reasoning Models Fail with Contextual Distractors

Seongyun Lee; Yongrae Jo; Minju Seo; Moontae Lee; Minjoon Seo

arXiv:2601.07226·cs.AI·January 13, 2026

Lost in the Noise: How Reasoning Models Fail with Contextual Distractors

Seongyun Lee, Yongrae Jo, Minju Seo, Moontae Lee, Minjoon Seo

PDF

Open Access

TL;DR

This paper introduces NoisyBench, a benchmark revealing that current reasoning models struggle significantly with noisy contexts, and proposes RARE to improve robustness by encouraging helpful information identification.

Contribution

The paper presents NoisyBench for evaluating model robustness against noise and introduces RARE, a novel training method that enhances model resilience to distractors.

Findings

01

State-of-the-art models drop up to 80% performance with distractors.

02

Agentic workflows can amplify errors by over-trusting noisy outputs.

03

Increased test-time computation worsens performance in noisy settings.

Abstract

Recent advances in reasoning models and agentic AI systems have led to an increased reliance on diverse external information. However, this shift introduces input contexts that are inherently noisy, a reality that current sanitized benchmarks fail to capture. We introduce NoisyBench, a comprehensive benchmark that systematically evaluates model robustness across 11 datasets in RAG, reasoning, alignment, and tool-use tasks against diverse noise types, including random documents, irrelevant chat histories, and hard negative distractors. Our evaluation reveals a catastrophic performance drop of up to 80% in state-of-the-art models when faced with contextual distractors. Crucially, we find that agentic workflows often amplify these errors by over-trusting noisy tool outputs, and distractors can trigger emergent misalignment even without adversarial intent. We find that prompting, context…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Explainable Artificial Intelligence (XAI) · Ethics and Social Impacts of AI