The Illusion of Reasoning: Exposing Evasive Data Contamination in LLMs via Zero-CoT Truncation

Yifan Lan; Yuanpu Cao; Hanyu Wang; Lu Lin; Jinghui Chen

arXiv:2605.21856·cs.LG·May 22, 2026

The Illusion of Reasoning: Exposing Evasive Data Contamination in LLMs via Zero-CoT Truncation

Yifan Lan, Yuanpu Cao, Hanyu Wang, Lu Lin, Jinghui Chen

PDF

1 Repo

TL;DR

This paper introduces Zero-CoT Probe, a novel black-box method that detects evasive data contamination in large language models by truncating reasoning steps and comparing performance on original and perturbed datasets.

Contribution

It presents a new detection technique for stealthy data contamination in LLMs, including a metric for contamination severity and a publicly available implementation.

Findings

01

ZCP effectively detects both direct and evasive contamination.

02

It distinguishes memorization from problem-solving abilities.

03

The method is validated on various contaminated models.

Abstract

Large language models (LLMs) have demonstrated impressive reasoning abilities across a wide range of tasks, but data contamination undermines the objective evaluation of these capabilities. This problem is further exacerbated by malicious model publishers who use evasive, or indirect, contamination strategies, such as paraphrasing benchmark data to evade existing detection methods and artificially boost leaderboard performance. Current approaches struggle to reliably detect such stealthy contamination. In this work, we uncover a critical phenomenon: a model's generated reasoning steps actively mask its underlying memorization. Inspired by this, we propose the Zero-CoT Probe (ZCP), a novel black-box detection method that deliberately truncates the entire Chain-of-Thought (CoT) process to expose latent shortcut mappings. To further isolate memorization from the model's intrinsic…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Yifan-Lan/zero-cot-probe
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.