e1: Learning Adaptive Control of Reasoning Effort
Michael Kleinman, Matthew Trager, Alessandro Achille, Wei Xia, Stefano Soatto

TL;DR
This paper introduces Adaptive Effort Control, a reinforcement learning method that allows AI models to dynamically allocate reasoning effort based on user preferences, improving cost-accuracy tradeoffs and efficiency across various model sizes.
Contribution
It presents a novel self-adaptive reinforcement learning approach enabling models to adjust reasoning effort proportionally to task difficulty without prior tuning.
Findings
2-3x reduction in reasoning length while maintaining performance
Better cost-accuracy tradeoff curves compared to standard methods
Effective across models from 1.5B to 32B parameters
Abstract
Increasing the thinking budget of AI models can significantly improve accuracy, but not all questions warrant the same amount of reasoning. Users may prefer to allocate different amounts of reasoning effort depending on how they value output quality versus latency and cost. To leverage this tradeoff effectively, users need fine-grained control over the amount of thinking used for a particular query, but few approaches enable such control. Existing methods require users to specify the absolute number of desired tokens, but this requires knowing the difficulty of the problem beforehand to appropriately set the token budget for a query. To address these issues, we propose Adaptive Effort Control, a self-adaptive reinforcement learning method that trains models to use a user-specified fraction of tokens relative to the current average chain-of-thought length for each query. This approach…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Topic Modeling · Multimodal Machine Learning Applications
