MIMONets: Multiple-Input-Multiple-Output Neural Networks Exploiting   Computation in Superposition

Nicolas Menet (1; 2); Michael Hersche (1; 2); Geethan; Karunaratne (1); Luca Benini (2); Abu Sebastian (1); Abbas Rahimi (1) ((1); IBM Research - Zurich; (2) ETH Zurich)

arXiv:2312.02829·cs.LG·December 6, 2023·6 cites

MIMONets: Multiple-Input-Multiple-Output Neural Networks Exploiting Computation in Superposition

Nicolas Menet (1, 2), Michael Hersche (1, 2), Geethan, Karunaratne (1), Luca Benini (2), Abu Sebastian (1), Abbas Rahimi (1) ((1), IBM Research - Zurich, (2) ETH Zurich)

PDF

Open Access 1 Repo 1 Video

TL;DR

MIMONets are neural networks designed to process multiple inputs simultaneously in superposition, reducing inference costs while maintaining accuracy, and allowing dynamic trade-offs between speed and precision.

Contribution

This paper introduces MIMONets, a novel neural network architecture that handles multiple inputs in superposition with variable binding, enabling faster inference and flexible accuracy-speed trade-offs.

Findings

01

Achieves 2-4x speedup with minimal accuracy loss on CIFAR datasets.

02

Handles 2-4 inputs simultaneously with high accuracy on long-range arena benchmark.

03

Provides mathematical bounds on interference in superposition channels.

Abstract

With the advent of deep learning, progressively larger neural networks have been designed to solve complex tasks. We take advantage of these capacity-rich models to lower the cost of inference by exploiting computation in superposition. To reduce the computational burden per input, we propose Multiple-Input-Multiple-Output Neural Networks (MIMONets) capable of handling many inputs at once. MIMONets augment various deep neural network architectures with variable binding mechanisms to represent an arbitrary number of inputs in a compositional data structure via fixed-width distributed representations. Accordingly, MIMONets adapt nonlinear neural transformations to process the data structure holistically, leading to a speedup nearly proportional to the number of superposed input items in the data structure. After processing in superposition, an unbinding mechanism recovers each transformed…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ibm/multiple-input-multiple-output-nets
pytorchOfficial

Videos

MIMONets: Multiple-Input-Multiple-Output Neural Networks Exploiting Computation in Superposition· slideslive

Taxonomy

TopicsAdvanced Neural Network Applications · Adversarial Robustness in Machine Learning · Domain Adaptation and Few-Shot Learning

MethodsSparse Evolutionary Training · *Communicated@Fast*How Do I Communicate to Expedia? · Multi-Head Attention · Linear Layer · Attention Is All You Need · Absolute Position Encodings · Convolution · Batch Normalization · Dropout · Dense Connections