An Investigation of Neuron Activation as a Unified Lens to Explain   Chain-of-Thought Eliciting Arithmetic Reasoning of LLMs

Daking Rai; Ziyu Yao

arXiv:2406.12288·cs.AI·September 4, 2024

An Investigation of Neuron Activation as a Unified Lens to Explain Chain-of-Thought Eliciting Arithmetic Reasoning of LLMs

Daking Rai, Ziyu Yao

PDF

Open Access 2 Repos 1 Video

TL;DR

This paper explores how neuron activation in large language models explains their arithmetic reasoning abilities with Chain-of-Thought prompts, offering a unified perspective on prior empirical observations.

Contribution

It introduces a neuron activation-based approach to explain LLM reasoning, specifically identifying reasoning neurons in feed-forward layers using GPT-4.

Findings

01

Activation of reasoning neurons correlates with LLM performance on arithmetic tasks.

02

Neuron analysis explains the importance of CoT prompt components.

03

Proposes an automatic method to identify reasoning neurons.

Abstract

Large language models (LLMs) have shown strong arithmetic reasoning capabilities when prompted with Chain-of-Thought (CoT) prompts. However, we have only a limited understanding of how they are processed by LLMs. To demystify it, prior work has primarily focused on ablating different components in the CoT prompt and empirically observing their resulting LLM performance change. Yet, the reason why these components are important to LLM reasoning is not explored. To fill this gap, in this work, we investigate ``neuron activation'' as a lens to provide a unified explanation to observations made by prior work. Specifically, we look into neurons within the feed-forward layers of LLMs that may have activated their arithmetic reasoning capabilities, using Llama2 as an example. To facilitate this investigation, we also propose an approach based on GPT-4 to automatically identify neurons that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

An Investigation of Neuron Activation as a Unified Lens to Explain Chain-of-Thought Eliciting Arithmetic Reasoning of LLMs· underline

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Advanced Text Analysis Techniques · Cognitive Science and Education Research

MethodsResidual Connection · Softmax · Layer Normalization · Byte Pair Encoding · Label Smoothing · Adam · Attention Is All You Need · Linear Layer · Multi-Head Attention · Position-Wise Feed-Forward Layer