Deconstructing Pre-training: Knowledge Attribution Analysis in MoE and Dense Models

Bo Wang; Junzhuo Li; Hong Chen; Yuanlin Chu; Yuxuan Fan; Xuming Hu

arXiv:2601.08383·cs.AI·January 14, 2026

Deconstructing Pre-training: Knowledge Attribution Analysis in MoE and Dense Models

Bo Wang, Junzhuo Li, Hong Chen, Yuanlin Chu, Yuxuan Fan, Xuming Hu

PDF

Open Access 1 Video

TL;DR

This paper introduces a neuron-level attribution metric to compare knowledge acquisition in MoE and dense models, revealing that MoE's sparsity leads to early, stable, and distributed knowledge storage during pre-training.

Contribution

It presents Gated-LPI, a novel neuron attribution method, and provides the first detailed comparison of knowledge dynamics in MoE versus dense architectures during training.

Findings

01

MoE neurons form a high-utility core capturing most positive updates

02

MoE models stabilize importance profiles early in training

03

Sparsity in MoE leads to distributed and robust knowledge storage

Abstract

Mixture-of-Experts (MoE) architectures decouple model capacity from per-token computation, enabling scaling beyond the computational limits imposed by dense scaling laws. Yet how MoE architectures shape knowledge acquisition during pre-training, and how this process differs from dense architectures, remains unknown. To address this issue, we introduce Gated-LPI (Log-Probability Increase), a neuron-level attribution metric that decomposes log-probability increase across neurons. We present a time-resolved comparison of knowledge acquisition dynamics in MoE and dense architectures, tracking checkpoints over 1.2M training steps (~ 5.0T tokens) and 600K training steps (~ 2.5T tokens), respectively. Our experiments uncover three patterns: (1) Low-entropy backbone. The top approximately 1% of MoE neurons capture over 45% of positive updates, forming a high-utility core, which is absent in the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Deconstructing Pre-training: Knowledge Attribution Analysis in MoE and Dense Models· underline

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Domain Adaptation and Few-Shot Learning · Explainable Artificial Intelligence (XAI)