More Experts Than Galaxies: Conditionally-overlapping Experts With   Biologically-Inspired Fixed Routing

Sagi Shaier; Francisco Pereira; Katharina von der Wense; Lawrence E; Hunter; Matt Jones

arXiv:2410.08003·cs.LG·February 13, 2025

More Experts Than Galaxies: Conditionally-overlapping Experts With Biologically-Inspired Fixed Routing

Sagi Shaier, Francisco Pereira, Katharina von der Wense, Lawrence E, Hunter, Matt Jones

PDF

Open Access 1 Repo 1 Video

TL;DR

COMET introduces a biologically-inspired fixed routing mechanism in sparse neural networks, creating overlapping experts that improve learning speed and generalization across diverse tasks.

Contribution

It proposes a novel fixed, biologically-inspired random projection gating method for sparse experts, avoiding issues of trainable gating and disjoint experts.

Findings

01

Faster learning per update step.

02

Improved out-of-sample generalization.

03

Effective across multiple tasks and architectures.

Abstract

The evolution of biological neural systems has led to both modularity and sparse coding, which enables energy efficiency and robustness across the diversity of tasks in the lifespan. In contrast, standard neural networks rely on dense, non-specialized architectures, where all model parameters are simultaneously updated to learn multiple tasks, leading to interference. Current sparse neural network approaches aim to alleviate this issue but are hindered by limitations such as 1) trainable gating functions that cause representation collapse, 2) disjoint experts that result in redundant computation and slow learning, and 3) reliance on explicit input or task IDs that limit flexibility and scalability. In this paper we propose Conditionally Overlapping Mixture of ExperTs (COMET), a general deep learning method that addresses these challenges by inducing a modular, sparse architecture with…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

shaier/comet
pytorchOfficial

Videos

More Experts Than Galaxies: Conditionally-Overlapping Experts with Biologically-Inspired Fixed Routing· slideslive

Taxonomy

TopicsComplex Network Analysis Techniques · Computability, Logic, AI Algorithms