Neuron-Level Differentiation of Memorization and Generalization in Large Language Models

Ko-Wei Huang; Yi-Fu Fu; Ching-Yu Tsai; Yu-Chieh Tu; Tzu-Ling Cheng; Cheng-Yu Lin; Yi-Ting Yang; Heng-Yi Liu; Keng-Te Liao; Da-Cheng Juan; Shou-De Lin

arXiv:2412.18497·cs.CL·July 10, 2025

Neuron-Level Differentiation of Memorization and Generalization in Large Language Models

Ko-Wei Huang, Yi-Fu Fu, Ching-Yu Tsai, Yu-Chieh Tu, Tzu-Ling Cheng, Cheng-Yu Lin, Yi-Ting Yang, Heng-Yi Liu, Keng-Te Liao, Da-Cheng Juan, Shou-De Lin

PDF

Open Access

TL;DR

This paper uncovers neuron-level distinctions between memorization and generalization in large language models, demonstrating that specific neurons are responsible for each behavior and can be manipulated at inference time to steer model responses.

Contribution

It identifies neuron subsets responsible for memorization and generalization, showing their modularity and enabling behavior control through inference-time interventions.

Findings

01

Neuron subsets are responsible for memorization and generalization.

02

Inference-time interventions can steer model behavior.

03

Neuron-behavior associations are consistent across models and tasks.

Abstract

We investigate how Large Language Models (LLMs) distinguish between memorization and generalization at the neuron level. Through carefully designed tasks, we identify distinct neuron subsets responsible for each behavior. Experiments on both a GPT-2 model trained from scratch and a pretrained LLaMA-3.2 model fine-tuned with LoRA show consistent neuron-level specialization. We further demonstrate that inference-time interventions on these neurons can steer the model's behavior toward memorization or generalization. To assess robustness, we evaluate intra-task and inter-task consistency, confirming that these neuron-behavior associations reflect generalizable patterns rather than dataset-specific artifacts. Our findings reveal modular structure in LLMs and enable controlling memorization and generalization behaviors at inference time.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Artificial Intelligence in Law