A Generalized Recurrent Neural Architecture for Text Classification with   Multi-Task Learning

Honglun Zhang; Liqiang Xiao; Yongkun Wang; Yaohui Jin

arXiv:1707.02892·cs.CL·July 11, 2017·6 cites

A Generalized Recurrent Neural Architecture for Text Classification with Multi-Task Learning

Honglun Zhang, Liqiang Xiao, Yongkun Wang, Yaohui Jin

PDF

Open Access

TL;DR

This paper introduces a flexible multi-task learning architecture with recurrent neural layers that models complex interactions among multiple text classification tasks, leading to significant performance improvements.

Contribution

It proposes a generalized recurrent neural architecture for multi-task learning that captures complex task interactions, surpassing previous simpler models.

Findings

01

Significant performance improvements on five benchmark datasets.

02

Effective modeling of complex correlations among three or more tasks.

03

Flexible architecture adaptable to various multi-task scenarios.

Abstract

Multi-task learning leverages potential correlations among related tasks to extract common features and yield performance gains. However, most previous works only consider simple or weak interactions, thereby failing to model complex correlations among three or more tasks. In this paper, we propose a multi-task learning architecture with four types of recurrent neural layers to fuse information across multiple related tasks. The architecture is structurally flexible and considers various interactions among tasks, which can be regarded as a generalized case of many previous works. Extensive experiments on five benchmark datasets for text classification show that our model can significantly improve performances of related tasks with additional information from others.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Text and Document Classification Technologies · Advanced Text Analysis Techniques