Boosting Share Routing for Multi-task Learning

Xiaokai Chen; Xiaoguang Gu; Libo Fu

arXiv:2009.00387·cs.LG·March 2, 2021

Boosting Share Routing for Multi-task Learning

Xiaokai Chen, Xiaoguang Gu, Libo Fu

PDF

TL;DR

This paper introduces MTNAS, a neural architecture search framework that optimizes task sharing routes in multi-task learning, leading to improved performance and reduced negative transfer across recommendation datasets.

Contribution

The paper proposes a flexible, dynamic sharing route search space for MTL, enabling automatic discovery of effective task sharing architectures beyond traditional parameter sharing methods.

Findings

01

MTNAS outperforms single-task and typical MTL models on recommendation datasets.

02

It learns sparse sharing routes that mitigate negative transfer.

03

The approach maintains high computational efficiency.

Abstract

Multi-task learning (MTL) aims to make full use of the knowledge contained in multi-task supervision signals to improve the overall performance. How to make the knowledge of multiple tasks shared appropriately is an open problem for MTL. Most existing deep MTL models are based on parameter sharing. However, suitable sharing mechanism is hard to design as the relationship among tasks is complicated. In this paper, we propose a general framework called Multi-Task Neural Architecture Search (MTNAS) to efficiently find a suitable sharing route for a given MTL problem. MTNAS modularizes the sharing part into multiple layers of sub-networks. It allows sparse connection among these sub-networks and soft sharing based on gating is enabled for a certain route. Benefiting from such setting, each candidate architecture in our search space defines a dynamic sparse sharing route which is more…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.