TL;DR
This paper introduces an automated, differentiable neural architecture search method to design resource-efficient, branched multi-task neural networks for vision problems, outperforming manual designs under resource constraints.
Contribution
It presents a novel differentiable search technique for automatically creating multi-task network architectures with resource-aware optimization.
Findings
Consistently finds high-performing structures within resource limits.
Outperforms manually designed architectures on dense prediction tasks.
Effective across various multi-modal vision tasks.
Abstract
The multi-modal nature of many vision problems calls for neural network architectures that can perform multiple tasks concurrently. Typically, such architectures have been handcrafted in the literature. However, given the size and complexity of the problem, this manual architecture exploration likely exceeds human design abilities. In this paper, we propose a principled approach, rooted in differentiable neural architecture search, to automatically define branching (tree-like) structures in the encoding stage of a multi-task neural network. To allow flexibility within resource-constrained environments, we introduce a proxyless, resource-aware loss that dynamically controls the model size. Evaluations across a variety of dense prediction tasks show that our approach consistently finds high-performing branching structures within limited resource budgets.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
