NeuReasoner: Towards Explainable, Controllable, and Unified Reasoning via Mixture-of-Neurons
Haonan Dong, Kehan Jiang, Haoran Ye, Wenhao Zhu, Zhaolu Kang, Guojie Song

TL;DR
NeuReasoner introduces a white-box, neuron-based framework for explainable, controllable, and unified reasoning in large models, addressing multiple failure modes and improving performance across benchmarks.
Contribution
It identifies key neurons linked to reasoning failures and develops NeuReasoner, a framework that integrates failure detection and self-correction for enhanced reasoning.
Findings
Achieves up to 27.0% performance improvement.
Reduces token consumption by up to 63.3%.
Demonstrates effectiveness across six benchmarks and multiple models.
Abstract
Large Reasoning Models (LRMs) have recently achieved remarkable success in complex reasoning tasks. However, closer scrutiny reveals persistent failure modes compromising performance and cost: I) Intra-step level, marked by calculation or derivation errors; II) Inter-step level, involving oscillation and stagnation; and III) Instance level, causing maladaptive over-thinking. Existing endeavors target isolated levels without unification, while their black-box nature and reliance on RL hinder explainability and controllability. To bridge these gaps, we conduct an in-depth white-box analysis, identifying key neurons (Mixture of Neurons, MoN) and their fluctuation patterns associated with distinct failures. Building upon these insights, we propose NeuReasoner, an explainable, controllable, and unified reasoning framework driven by MoN. Technically, NeuReasoner integrates lightweight MLPs…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
