Empirical Evaluation of Multi-task Learning in Deep Neural Networks for   Natural Language Processing

Jianquan Li; Xiaokang Liu; Wenpeng Yin; Min Yang; Liqun Ma; Yaohong; Jin

arXiv:1908.07820·cs.CL·August 10, 2020·1 cites

Empirical Evaluation of Multi-task Learning in Deep Neural Networks for Natural Language Processing

Jianquan Li, Xiaokang Liu, Wenpeng Yin, Min Yang, Liqun Ma, Yaohong, Jin

PDF

Open Access

TL;DR

This paper systematically evaluates various multi-task learning architectures and mechanisms in NLP, aiming to understand their strengths and weaknesses and to develop improved hybrid models.

Contribution

It provides a comprehensive comparison of existing MTL methods in NLP and proposes new hybrid architectures to enhance performance.

Findings

01

Thorough comparison of MTL architectures across NLP tasks

02

Identification of strengths and weaknesses of existing methods

03

Development of hybrid models combining best features

Abstract

Multi-Task Learning (MTL) aims at boosting the overall performance of each individual task by leveraging useful information contained in multiple related tasks. It has shown great success in natural language processing (NLP). Currently, a number of MLT architectures and learning mechanisms have been proposed for various NLP tasks. However, there is no systematic exploration and comparison of different MLT architectures and learning mechanisms for their strong performance in-depth. In this paper, we conduct a thorough examination of typical MTL methods on a broad range of representative NLP tasks. Our primary goal is to understand the merits and demerits of existing MTL methods in NLP tasks, thus devising new hybrid architectures intended to combine their strengths.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Domain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications