AdaReasoner: Adaptive Reasoning Enables More Flexible Thinking in Large Language Models
Xiangqi Wang, Yue Huang, Yanbo Wang, Xiaonan Luo, Kehan Guo, Yujun Zhou, Xiangliang Zhang

TL;DR
AdaReasoner is a reinforcement learning-based plugin that adaptively configures reasoning parameters for large language models, improving task-specific performance across diverse reasoning tasks.
Contribution
It introduces AdaReasoner, a novel RL framework for automating adaptive reasoning configurations in LLMs, with theoretical guarantees and broad empirical validation.
Findings
Outperforms standard baselines across six LLMs
Maintains robustness on out-of-distribution tasks
Enhances knowledge-intensive task performance
Abstract
LLMs often need effective configurations, like temperature and reasoning steps, to handle tasks requiring sophisticated reasoning and problem-solving, ranging from joke generation to mathematical reasoning. Existing prompting approaches usually adopt general-purpose, fixed configurations that work 'well enough' across tasks but seldom achieve task-specific optimality. To address this gap, we introduce AdaReasoner, an LLM-agnostic plugin designed for any LLM to automate adaptive reasoning configurations for tasks requiring different types of thinking. AdaReasoner is trained using a reinforcement learning (RL) framework, combining a factorized action space with a targeted exploration strategy, along with a pretrained reward model to optimize the policy model for reasoning configurations with only a few-shot guide. AdaReasoner is backed by theoretical guarantees and experiments of fast…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Multimodal Machine Learning Applications · Text Readability and Simplification
