Grassmannian Mixture-of-Experts: Concentration-Controlled Routing on Subspace Manifolds
Ibne Farabi Shihab, Sanjeda Akter, Anuj Sharma

TL;DR
This paper introduces Grassmannian MoE, a novel routing framework on subspace manifolds that offers a continuous, interpretable control over sparsity and routing entropy, improving load balancing and interpretability in mixture-of-experts models.
Contribution
We propose a new Grassmannian MoE framework using concentration parameters of Matrix Bingham distributions for principled, smooth sparsity control and uncertainty-aware expert routing.
Findings
Achieves 0% routing collapse across all seeds.
Improves load balance by 15-30%.
Provides interpretable, heterogeneous expert specialization.
Abstract
Mixture-of-Experts models rely on learned routers to assign tokens to experts, yet standard softmax gating provides no principled mechanism to control the tradeoff between sparsity and utilization. We propose Grassmannian MoE (GrMoE), a routing framework that operates on the Grassmannian manifold of subspaces, where gating weights arise from the concentration parameters of Matrix Bingham distributions. This construction yields a single, interpretable knob -- the concentration matrix -- that continuously controls routing entropy, replacing discrete top- selection with a smooth, geometrically principled sparsity mechanism. We further develop an amortized variational inference procedure for posterior routing distributions, enabling uncertainty-aware expert assignment that naturally resists expert collapse. We formally prove tight bounds relating the Bingham concentration…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComplex Network Analysis Techniques · Stochastic Gradient Optimization Techniques · Advanced Graph Neural Networks
