How Do Large Language Models Learn Concepts During Continual Pre-Training?
Barry Menglong Yao (1), Sha Li (2), Yunzhi Yao (3), Minqian Liu (2), Zaishuo Xia (1), Qifan Wang (4), Lifu Huang (1) ((1) UC Davis, (2) Virginia Tech, (3) UCLA, (4) Meta AI)

TL;DR
This paper investigates how large language models acquire, retain, and forget concepts during continual pretraining, revealing circuit-level dynamics and interactions that influence learning and interference.
Contribution
It introduces the concept circuits framework and analyzes their structure and behavior, providing new insights into concept learning in LLMs during continual training.
Findings
Concept circuits correlate with concept learning and forgetting.
Concepts show stage-wise changes during training.
Semantic similarity increases interference among concepts.
Abstract
Human beings primarily understand the world through concepts (e.g., dog), abstract mental representations that structure perception, reasoning, and learning. However, how large language models (LLMs) acquire, retain, and forget such concepts during continual pretraining remains poorly understood. In this work, we study how individual concepts are acquired and forgotten, as well as how multiple concepts interact through interference and synergy. We link these behavioral dynamics to LLMs' internal Concept Circuits, computational subgraphs associated with specific concepts, and incorporate Graph Metrics to characterize circuit structure. Our analysis reveals: (1) LLMs concept circuits provide a non-trivial, statistically significant signal of concept learning and forgetting; (2) Concept circuits exhibit a stage-wise temporal pattern during continual pretraining, with an early increase…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsChild and Animal Learning Development · Action Observation and Synchronization · Embodied and Extended Cognition
