Does More Inference-Time Compute Really Help Robustness?

Tong Wu; Chong Xiang; Jiachen T. Wang; Weichen Yu; Chawin Sitawarin; Vikash Sehwag; Prateek Mittal

arXiv:2507.15974·cs.AI·July 23, 2025

Does More Inference-Time Compute Really Help Robustness?

Tong Wu, Chong Xiang, Jiachen T. Wang, Weichen Yu, Chawin Sitawarin, Vikash Sehwag, Prateek Mittal

PDF

Open Access

TL;DR

Increasing inference-time compute can enhance robustness in large models, but exposing reasoning steps to adversaries can reverse this benefit, highlighting the importance of security considerations.

Contribution

This work demonstrates that inference-time scaling benefits are context-dependent and reveals security risks when reasoning steps are accessible to attackers.

Findings

01

Inference-time scaling improves robustness in open-source models.

02

Explicit reasoning steps reduce model robustness under attack.

03

Security risks increase when reasoning steps are exposed.

Abstract

Recently, Zaremba et al. demonstrated that increasing inference-time computation improves robustness in large proprietary reasoning LLMs. In this paper, we first show that smaller-scale, open-source models (e.g., DeepSeek R1, Qwen3, Phi-reasoning) can also benefit from inference-time scaling using a simple budget forcing strategy. More importantly, we reveal and critically examine an implicit assumption in prior work: intermediate reasoning steps are hidden from adversaries. By relaxing this assumption, we identify an important security risk, intuitively motivated and empirically verified as an inverse scaling law: if intermediate reasoning steps become explicitly accessible, increased inference-time computation consistently reduces model robustness. Finally, we discuss practical scenarios where models with hidden reasoning chains are still vulnerable to attacks, such as models with…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFault Detection and Control Systems · Bayesian Modeling and Causal Inference