Efficient Controllable Multi-Task Architectures

Abhishek Aich; Samuel Schulter; Amit K. Roy-Chowdhury; Manmohan; Chandraker; Yumin Suh

arXiv:2308.11744·cs.CV·August 24, 2023

Efficient Controllable Multi-Task Architectures

Abhishek Aich, Samuel Schulter, Amit K. Roy-Chowdhury, Manmohan, Chandraker, Yumin Suh

PDF

Open Access 1 Video

TL;DR

This paper introduces a flexible multi-task model with slimmable encoder and decoder components, enabling post-deployment adjustment of task importance and computational budget without retraining, using a novel training and search strategy.

Contribution

It proposes a multi-task architecture with slimmable channels and a configuration-invariant knowledge distillation loss, allowing dynamic control over task performance and computational cost after deployment.

Findings

01

Achieves higher controllability (~33.5%) over prior methods on NYUD-v2.

02

Enables adjustment of task importance and compute budget post-deployment.

03

Demonstrates effectiveness on multiple benchmarks with diverse architectures.

Abstract

We aim to train a multi-task model such that users can adjust the desired compute budget and relative importance of task performances after deployment, without retraining. This enables optimizing performance for dynamically varying user needs, without heavy computational overhead to train and save models for various scenarios. To this end, we propose a multi-task model consisting of a shared encoder and task-specific decoders where both encoder and decoder channel widths are slimmable. Our key idea is to control the task importance by varying the capacities of task-specific decoders, while controlling the total computational cost by jointly adjusting the encoder capacity. This improves overall accuracy by allowing a stronger encoder for a given budget, increases control over computational cost, and delivers high-quality slimmed sub-architectures based on user's constraints. Our training…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Efficient Controllable Multi-Task Architectures· youtube

Taxonomy

TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning