More Experts Than Galaxies: Conditionally-overlapping Experts With Biologically-Inspired Fixed Routing
Sagi Shaier, Francisco Pereira, Katharina von der Wense, Lawrence E, Hunter, Matt Jones

TL;DR
COMET introduces a biologically-inspired fixed routing mechanism in sparse neural networks, creating overlapping experts that improve learning speed and generalization across diverse tasks.
Contribution
It proposes a novel fixed, biologically-inspired random projection gating method for sparse experts, avoiding issues of trainable gating and disjoint experts.
Findings
Faster learning per update step.
Improved out-of-sample generalization.
Effective across multiple tasks and architectures.
Abstract
The evolution of biological neural systems has led to both modularity and sparse coding, which enables energy efficiency and robustness across the diversity of tasks in the lifespan. In contrast, standard neural networks rely on dense, non-specialized architectures, where all model parameters are simultaneously updated to learn multiple tasks, leading to interference. Current sparse neural network approaches aim to alleviate this issue but are hindered by limitations such as 1) trainable gating functions that cause representation collapse, 2) disjoint experts that result in redundant computation and slow learning, and 3) reliance on explicit input or task IDs that limit flexibility and scalability. In this paper we propose Conditionally Overlapping Mixture of ExperTs (COMET), a general deep learning method that addresses these challenges by inducing a modular, sparse architecture with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsComplex Network Analysis Techniques · Computability, Logic, AI Algorithms
