Word Salad Chopper: Reasoning Models Waste A Ton Of Decoding Budget On Useless Repetitions, Self-Knowingly

Wenya Xie; Shaochen (Henry) Zhong; Hoang Anh Duy Le; Zhaozhuo Xu; Jianwen Xie; Zirui Liu

arXiv:2511.00536·cs.CL·November 4, 2025

Word Salad Chopper: Reasoning Models Waste A Ton Of Decoding Budget On Useless Repetitions, Self-Knowingly

Wenya Xie, Shaochen (Henry) Zhong, Hoang Anh Duy Le, Zhaozhuo Xu, Jianwen Xie, Zirui Liu

PDF

Open Access 1 Video

TL;DR

This paper introduces WordSaladChopper, a lightweight method to detect and remove useless repetitive tokens in large reasoning models, significantly reducing decoding costs without sacrificing output quality.

Contribution

It presents a novel on-the-fly detection technique for word salad behavior in LRMs and a simple chopping method to improve decoding efficiency.

Findings

01

Detects word salad with a linear classifier based on hidden states.

02

Reduces decoding length with minimal quality loss.

03

Achieves substantial cost savings in reasoning tasks.

Abstract

Large Reasoning Models (LRMs) are often bottlenecked by the high cost of output tokens. We show that a significant portion of these tokens are useless self-repetitions - what we call "word salad" - that exhaust the decoding budget without adding value. Interestingly, we observe that LRMs are self-aware when trapped in these loops: the hidden states of <\n\n> tokens trailing each reasoning chunk exhibit patterns that allow us to detect word salad behavior on-the-fly via a single-layer linear classifier. Once detected, a simple chop appended by a straightforward regeneration prompt yields substantial length savings with minimal quality loss. Our work offers WordSaladChopper (WSC) - a lightweight, turnkey component for LRM that is minimally invasive to its reasoning trajectory by only removing semantically redundant tokens. Given its low overhead, strong savings, and the lack of semantic…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Word Salad Chopper: Reasoning Models Waste A Ton Of Decoding Budget On Useless Repetitions, Self-Knowingly· underline

Taxonomy

TopicsTopic Modeling · Multimodal Machine Learning Applications · Explainable Artificial Intelligence (XAI)