Learning to Branch for Multi-Task Learning

Pengsheng Guo; Chen-Yu Lee; Daniel Ulbricht

arXiv:2006.01895·cs.LG·June 11, 2020·42 cites

Learning to Branch for Multi-Task Learning

Pengsheng Guo, Chen-Yu Lee, Daniel Ulbricht

PDF

Open Access 1 Video

TL;DR

This paper introduces an automated, end-to-end trainable method for learning optimal branching structures in multi-task neural networks, improving task-specific performance and reducing negative transfer.

Contribution

It proposes a novel differentiable tree-structured design space using gumbel-softmax sampling to learn where to share or branch within a network for multiple tasks.

Findings

01

Effective network topologies learned on synthetic, CelebA, and Taskonomy datasets.

02

Outperforms prior methods relying on manual or heuristic branching strategies.

03

Demonstrates improved multi-task learning performance and reduced negative transfer.

Abstract

Training multiple tasks jointly in one deep network yields reduced latency during inference and better performance over the single-task counterpart by sharing certain layers of a network. However, over-sharing a network could erroneously enforce over-generalization, causing negative knowledge transfer across tasks. Prior works rely on human intuition or pre-computed task relatedness scores for ad hoc branching structures. They provide sub-optimal end results and often require huge efforts for the trial-and-error process. In this work, we present an automated multi-task learning algorithm that learns where to share or branch within a network, designing an effective network topology that is directly optimized for multiple objectives across tasks. Specifically, we propose a novel tree-structured design space that casts a tree branching operation as a gumbel-softmax sampling procedure. This…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Learning to Branch for Multi-Task Learning· slideslive

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Advanced Neural Network Applications · Machine Learning and Data Classification