An Investigation of Neuron Activation as a Unified Lens to Explain Chain-of-Thought Eliciting Arithmetic Reasoning of LLMs
Daking Rai, Ziyu Yao

TL;DR
This paper explores how neuron activation in large language models explains their arithmetic reasoning abilities with Chain-of-Thought prompts, offering a unified perspective on prior empirical observations.
Contribution
It introduces a neuron activation-based approach to explain LLM reasoning, specifically identifying reasoning neurons in feed-forward layers using GPT-4.
Findings
Activation of reasoning neurons correlates with LLM performance on arithmetic tasks.
Neuron analysis explains the importance of CoT prompt components.
Proposes an automatic method to identify reasoning neurons.
Abstract
Large language models (LLMs) have shown strong arithmetic reasoning capabilities when prompted with Chain-of-Thought (CoT) prompts. However, we have only a limited understanding of how they are processed by LLMs. To demystify it, prior work has primarily focused on ablating different components in the CoT prompt and empirically observing their resulting LLM performance change. Yet, the reason why these components are important to LLM reasoning is not explored. To fill this gap, in this work, we investigate ``neuron activation'' as a lens to provide a unified explanation to observations made by prior work. Specifically, we look into neurons within the feed-forward layers of LLMs that may have activated their arithmetic reasoning capabilities, using Llama2 as an example. To facilitate this investigation, we also propose an approach based on GPT-4 to automatically identify neurons that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Advanced Text Analysis Techniques · Cognitive Science and Education Research
MethodsResidual Connection · Softmax · Layer Normalization · Byte Pair Encoding · Label Smoothing · Adam · Attention Is All You Need · Linear Layer · Multi-Head Attention · Position-Wise Feed-Forward Layer
