Sparsity is Combinatorial Depth: Quantifying MoE Expressivity via Tropical Geometry
Ye Su, Huayi Tang, Zixuan Gong, Yong Liu

TL;DR
This paper introduces a geometric framework using tropical geometry to analyze the expressivity of Mixture-of-Experts architectures, revealing how sparsity enhances capacity and resilience.
Contribution
It establishes a novel algebraic isomorphism between MoE routing and tropical polynomials, providing geometric insights into capacity and architectural design.
Findings
MoE expressivity scales with binomial coefficients due to sparsity.
Dense networks suffer capacity collapse on low-dimensional data.
Shared experts are necessary to prevent routing collapse.
Abstract
While Mixture-of-Experts (MoE) architectures define the state-of-the-art, their theoretical success is often attributed to heuristic efficiency rather than geometric expressivity. In this work, we present the first analysis of MoE through the lens of tropical geometry, establishing that the Top- routing mechanism is algebraically isomorphic to the -th elementary symmetric tropical polynomial. This isomorphism partitions the input space into the Normal Fan of a Hypersimplex, revealing that \textbf{sparsity is combinatorial depth} which scales geometric capacity by the binomial coefficient . Moving beyond ambient bounds, we introduce the concept of \textit{Effective Capacity} under the Manifold Hypothesis. We prove that while dense networks suffer from capacity collapse on low-dimensional data, MoE architectures exhibit \textit{Combinatorial Resilience}, maintaining…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
