Superposition of many models into one

Brian Cheung; Alex Terekhov; Yubei Chen; Pulkit Agrawal; Bruno; Olshausen

arXiv:1902.05522·cs.LG·June 18, 2019·46 cites

Superposition of many models into one

Brian Cheung, Alex Terekhov, Yubei Chen, Pulkit Agrawal, Bruno, Olshausen

PDF

Open Access 1 Repo

TL;DR

This paper introduces a method to store multiple neural network models within a single set of parameters, allowing individual retrieval and training without interference, effectively utilizing network capacity during training.

Contribution

The authors propose a novel superposition technique enabling multiple models to coexist in one parameter set, expanding the capacity of neural networks for storage and training.

Findings

01

Large number of models can be stored simultaneously

02

Models can be trained thousands of steps without interference

03

Superposition acts as an online form of model compression

Abstract

We present a method for storing multiple models within a single set of parameters. Models can coexist in superposition and still be retrieved individually. In experiments with neural networks, we show that a surprisingly large number of models can be effectively stored within a single parameter instance. Furthermore, each of these models can undergo thousands of training steps without significantly interfering with other models within the superposition. This approach may be viewed as the online complement of compression: rather than reducing the size of a network after training, we make use of the unrealized capacity of a network during training.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

briancheung/superposition
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Topic Modeling · Multimodal Machine Learning Applications