MixMo: Mixing Multiple Inputs for Multiple Outputs via Deep Subnetworks
Alexandre Rame, Remy Sun, Matthieu Cord

TL;DR
MixMo introduces a novel framework for multi-input multi-output deep subnetworks that employs feature mixing techniques like CutMix to improve classification performance without additional inference costs.
Contribution
The paper proposes a new mixing mechanism for subnetworks in deep learning, replacing summing with feature mixing inspired by data augmentation, leading to improved accuracy.
Findings
Achieves state-of-the-art results on CIFAR-100 and Tiny ImageNet.
Outperforms data-augmented deep ensembles in accuracy.
Operates efficiently without extra inference or memory overhead.
Abstract
Recent strategies achieved ensembling "for free" by fitting concurrently diverse subnetworks inside a single base network. The main idea during training is that each subnetwork learns to classify only one of the multiple inputs simultaneously provided. However, the question of how to best mix these multiple inputs has not been studied so far. In this paper, we introduce MixMo, a new generalized framework for learning multi-input multi-output deep subnetworks. Our key motivation is to replace the suboptimal summing operation hidden in previous approaches by a more appropriate mixing mechanism. For that purpose, we draw inspiration from successful mixed sample data augmentations. We show that binary mixing in features - particularly with rectangular patches from CutMix - enhances results by making subnetworks stronger and more diverse. We improve state of the art for image classification…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Advanced Neural Network Applications · Domain Adaptation and Few-Shot Learning
MethodsCutMix
