MIMONets: Multiple-Input-Multiple-Output Neural Networks Exploiting Computation in Superposition
Nicolas Menet (1, 2), Michael Hersche (1, 2), Geethan, Karunaratne (1), Luca Benini (2), Abu Sebastian (1), Abbas Rahimi (1) ((1), IBM Research - Zurich, (2) ETH Zurich)

TL;DR
MIMONets are neural networks designed to process multiple inputs simultaneously in superposition, reducing inference costs while maintaining accuracy, and allowing dynamic trade-offs between speed and precision.
Contribution
This paper introduces MIMONets, a novel neural network architecture that handles multiple inputs in superposition with variable binding, enabling faster inference and flexible accuracy-speed trade-offs.
Findings
Achieves 2-4x speedup with minimal accuracy loss on CIFAR datasets.
Handles 2-4 inputs simultaneously with high accuracy on long-range arena benchmark.
Provides mathematical bounds on interference in superposition channels.
Abstract
With the advent of deep learning, progressively larger neural networks have been designed to solve complex tasks. We take advantage of these capacity-rich models to lower the cost of inference by exploiting computation in superposition. To reduce the computational burden per input, we propose Multiple-Input-Multiple-Output Neural Networks (MIMONets) capable of handling many inputs at once. MIMONets augment various deep neural network architectures with variable binding mechanisms to represent an arbitrary number of inputs in a compositional data structure via fixed-width distributed representations. Accordingly, MIMONets adapt nonlinear neural transformations to process the data structure holistically, leading to a speedup nearly proportional to the number of superposed input items in the data structure. After processing in superposition, an unbinding mechanism recovers each transformed…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAdvanced Neural Network Applications · Adversarial Robustness in Machine Learning · Domain Adaptation and Few-Shot Learning
MethodsSparse Evolutionary Training · *Communicated@Fast*How Do I Communicate to Expedia? · Multi-Head Attention · Linear Layer · Attention Is All You Need · Absolute Position Encodings · Convolution · Batch Normalization · Dropout · Dense Connections
