MoE Pathfinder: Trajectory-driven Expert Pruning

Xican Yang; Yuanhe Tian; Yan Song

arXiv:2512.18425·cs.LG·December 23, 2025

MoE Pathfinder: Trajectory-driven Expert Pruning

Xican Yang, Yuanhe Tian, Yan Song

PDF

Open Access

TL;DR

This paper introduces a trajectory-based expert pruning method for Mixture-of-Experts models, improving efficiency by globally optimizing expert retention across layers using multiple importance signals.

Contribution

It presents a novel global path planning approach for expert pruning in MoE models, leveraging trajectory analysis and multiple importance signals for better pruning decisions.

Findings

01

Achieves superior pruning performance on various tasks.

02

Outperforms existing expert pruning methods.

03

Reduces computational overhead effectively.

Abstract

Mixture-of-experts (MoE) architectures used in large language models (LLMs) achieve state-of-the-art performance across diverse tasks yet face practical challenges such as deployment complexity and low activation efficiency. Expert pruning has thus emerged as a promising solution to reduce computational overhead and simplify the deployment of MoE models. However, existing expert pruning approaches conventionally rely on local importance metrics and often apply uniform layer-wise pruning, leveraging only partial evaluation signals and overlooking the heterogeneous contributions of experts across layers. To address these limitations, we propose an expert pruning approach based on the trajectory of activated experts across layers, which treats MoE as a weighted computation graph and casts expert selection as a global optimal path planning problem. Within this framework, we integrate…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsOpportunistic and Delay-Tolerant Networks · Human Mobility and Location-Based Analysis · Mobile Crowdsensing and Crowdsourcing