STU-PID: Steering Token Usage via PID Controller for Efficient Large Language Model Reasoning

Aryasomayajula Ram Bharadwaj

arXiv:2506.18831·cs.CL·April 14, 2026

STU-PID: Steering Token Usage via PID Controller for Efficient Large Language Model Reasoning

Aryasomayajula Ram Bharadwaj

PDF

TL;DR

STUPID uses a PID controller to dynamically steer token usage in large language models, reducing redundancy and computational costs while improving reasoning accuracy.

Contribution

It introduces a training-free, adaptive method combining a classifier and PID control to optimize reasoning efficiency during inference.

Findings

01

Achieves 6% accuracy improvement on GSM8K

02

Reduces token usage by 32%

03

Outperforms static steering baselines

Abstract

Large Language Models employing extended chain-of-thought (CoT) reasoning often suffer from the overthinking phenomenon, generating excessive and redundant reasoning steps that increase computational costs while potentially degrading performance. While recent work has explored static steering approaches to mitigate this issue, they lack the adaptability to dynamically adjust intervention strength based on real-time reasoning quality. We propose STUPID (Steering Token Usage via PID controller), a novel training-free method that employs a PID controller to dynamically modulate activation steering strength during inference. Our approach combines a chunk-level classifier for detecting redundant reasoning patterns with a PID control mechanism that adaptively adjusts steering intensity based on the predicted redundancy probability. Experimental evaluation on GSM8K demonstrates that STUPID…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.