MutualNet: Adaptive ConvNet via Mutual Learning from Different Model   Configurations

Taojiannan Yang; Sijie Zhu; Matias Mendieta; Pu Wang; Ravikumar; Balakrishnan; Minwoo Lee; Tao Han; Mubarak Shah; Chen Chen

arXiv:2105.07085·cs.CV·January 3, 2022

MutualNet: Adaptive ConvNet via Mutual Learning from Different Model Configurations

Taojiannan Yang, Sijie Zhu, Matias Mendieta, Pu Wang, Ravikumar, Balakrishnan, Minwoo Lee, Tao Han, Mubarak Shah, Chen Chen

PDF

Open Access 1 Repo

TL;DR

MutualNet is a versatile training approach that enables a single neural network to operate efficiently across various resource constraints by mutual learning among different configurations, improving performance and reducing training costs.

Contribution

This work introduces MutualNet, a novel mutual learning framework that trains a single network to adapt to multiple configurations, applicable to diverse architectures and tasks.

Findings

01

Achieves consistent improvements across various datasets and models.

02

Reduces training costs by avoiding multiple independent trainings.

03

Enhances model performance even without resource constraints.

Abstract

Most existing deep neural networks are static, which means they can only do inference at a fixed complexity. But the resource budget can vary substantially across different devices. Even on a single device, the affordable budget can change with different scenarios, and repeatedly training networks for each required budget would be incredibly expensive. Therefore, in this work, we propose a general method called MutualNet to train a single network that can run at a diverse set of resource constraints. Our method trains a cohort of model configurations with various network widths and input resolutions. This mutual learning scheme not only allows the model to run at different width-resolution configurations but also transfers the unique knowledge among these configurations, helping the model to learn stronger representations overall. MutualNet is a general training methodology that can be…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

taoyang1122/MutualNet
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Human Pose and Action Recognition

Methods*Communicated@Fast*How Do I Communicate to Expedia? · Residual Connection · Average Pooling · Global Average Pooling · Batch Normalization · Convolution · Max Pooling · 1x1 Convolution · Residual Block · Kaiming Initialization