Efficient Controllable Multi-Task Architectures
Abhishek Aich, Samuel Schulter, Amit K. Roy-Chowdhury, Manmohan, Chandraker, Yumin Suh

TL;DR
This paper introduces a flexible multi-task model with slimmable encoder and decoder components, enabling post-deployment adjustment of task importance and computational budget without retraining, using a novel training and search strategy.
Contribution
It proposes a multi-task architecture with slimmable channels and a configuration-invariant knowledge distillation loss, allowing dynamic control over task performance and computational cost after deployment.
Findings
Achieves higher controllability (~33.5%) over prior methods on NYUD-v2.
Enables adjustment of task importance and compute budget post-deployment.
Demonstrates effectiveness on multiple benchmarks with diverse architectures.
Abstract
We aim to train a multi-task model such that users can adjust the desired compute budget and relative importance of task performances after deployment, without retraining. This enables optimizing performance for dynamically varying user needs, without heavy computational overhead to train and save models for various scenarios. To this end, we propose a multi-task model consisting of a shared encoder and task-specific decoders where both encoder and decoder channel widths are slimmable. Our key idea is to control the task importance by varying the capacities of task-specific decoders, while controlling the total computational cost by jointly adjusting the encoder capacity. This improves overall accuracy by allowing a stronger encoder for a given budget, increases control over computational cost, and delivers high-quality slimmed sub-architectures based on user's constraints. Our training…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Efficient Controllable Multi-Task Architectures· youtube
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning
