MutualNet: Adaptive ConvNet via Mutual Learning from Different Model Configurations
Taojiannan Yang, Sijie Zhu, Matias Mendieta, Pu Wang, Ravikumar, Balakrishnan, Minwoo Lee, Tao Han, Mubarak Shah, Chen Chen

TL;DR
MutualNet is a versatile training approach that enables a single neural network to operate efficiently across various resource constraints by mutual learning among different configurations, improving performance and reducing training costs.
Contribution
This work introduces MutualNet, a novel mutual learning framework that trains a single network to adapt to multiple configurations, applicable to diverse architectures and tasks.
Findings
Achieves consistent improvements across various datasets and models.
Reduces training costs by avoiding multiple independent trainings.
Enhances model performance even without resource constraints.
Abstract
Most existing deep neural networks are static, which means they can only do inference at a fixed complexity. But the resource budget can vary substantially across different devices. Even on a single device, the affordable budget can change with different scenarios, and repeatedly training networks for each required budget would be incredibly expensive. Therefore, in this work, we propose a general method called MutualNet to train a single network that can run at a diverse set of resource constraints. Our method trains a cohort of model configurations with various network widths and input resolutions. This mutual learning scheme not only allows the model to run at different width-resolution configurations but also transfers the unique knowledge among these configurations, helping the model to learn stronger representations overall. MutualNet is a general training methodology that can be…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Human Pose and Action Recognition
Methods*Communicated@Fast*How Do I Communicate to Expedia? · Residual Connection · Average Pooling · Global Average Pooling · Batch Normalization · Convolution · Max Pooling · 1x1 Convolution · Residual Block · Kaiming Initialization
