When is multitask learning effective? Semantic sequence prediction under   varying data conditions

H\'ector Mart\'inez Alonso; Barbara Plank

arXiv:1612.02251·cs.CL·January 11, 2017·5 cites

When is multitask learning effective? Semantic sequence prediction under varying data conditions

H\'ector Mart\'inez Alonso, Barbara Plank

PDF

Open Access

TL;DR

This paper investigates the conditions under which multitask learning improves semantic sequence prediction, highlighting that its effectiveness depends on data characteristics and auxiliary task properties.

Contribution

It introduces a comprehensive evaluation of MTL for semantic tasks, including a novel auxiliary task setup, and identifies data-dependent factors influencing success.

Findings

01

MTL improves only 1 of 5 semantic tasks significantly.

02

Auxiliary tasks with compact, uniform label distributions are more effective.

03

Effectiveness of MTL varies based on data and task characteristics.

Abstract

Multitask learning has been applied successfully to a range of tasks, mostly morphosyntactic. However, little is known on when MTL works and whether there are data characteristics that help to determine its success. In this paper we evaluate a range of semantic sequence labeling tasks in a MTL setup. We examine different auxiliary tasks, amongst which a novel setup, and correlate their impact to data-dependent conditions. Our results show that MTL is not always effective, significant improvements are obtained only for 1 out of 5 tasks. When successful, auxiliary tasks with compact and more uniform label distributions are preferable.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Topic Modeling · Machine Learning and Data Classification