Trace Norm Regularised Deep Multi-Task Learning

Yongxin Yang; Timothy M. Hospedales

arXiv:1606.04038·cs.LG·February 20, 2017·63 cites

Trace Norm Regularised Deep Multi-Task Learning

Yongxin Yang, Timothy M. Hospedales

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel deep multi-task learning framework that uses tensor trace norm regularization to automatically learn optimal parameter sharing strategies across neural networks, enhancing flexibility and efficiency.

Contribution

It presents a data-driven approach to learn parameter sharing in multi-task learning without predefining sharing strategies, using tensor trace norm regularization.

Findings

01

Effective automatic sharing learned across models

02

Improved multi-task learning performance

03

Flexible sharing strategy adapts to data

Abstract

We propose a framework for training multiple neural networks simultaneously. The parameters from all models are regularised by the tensor trace norm, so that each neural network is encouraged to reuse others' parameters if possible -- this is the main motivation behind multi-task learning. In contrast to many deep multi-task learning models, we do not predefine a parameter sharing strategy by specifying which layers have tied parameters. Instead, our framework considers sharing for all shareable layers, and the sharing strategy is learned in a data-driven way.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

wOOL/TNRDMTL
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTensor decomposition and applications · Domain Adaptation and Few-Shot Learning · Model Reduction and Neural Networks