Network Clustering for Multi-task Learning

Dehong Gao; Wenjing Yang; Huiling Zhou; Yi Wei; Yi Hu; Hao Wang

arXiv:2101.09018·cs.IR·January 25, 2021·1 cites

Network Clustering for Multi-task Learning

Dehong Gao, Wenjing Yang, Huiling Zhou, Yi Wei, Yi Hu, Hao Wang

PDF

Open Access

TL;DR

This paper introduces a novel cluster layer for multi-task learning that groups related tasks to improve the transition from general to specific representations, enhancing model efficiency.

Contribution

The paper proposes a new cluster layer mechanism that dynamically groups tasks during training to better facilitate the transition from general to task-specific features in MTL.

Findings

01

Cluster layer improves MTL performance on document classification

02

Model efficiently transitions from general to specific representations

03

Experimental results validate the effectiveness of the clustering approach

Abstract

The Multi-Task Learning (MTL) technique has been widely studied by word-wide researchers. The majority of current MTL studies adopt the hard parameter sharing structure, where hard layers tend to learn general representations over all tasks and specific layers are prone to learn specific representations for each task. Since the specific layers directly follow the hard layers, the MTL model needs to estimate this direct change (from general to specific) as well. To alleviate this problem, we introduce the novel cluster layer, which groups tasks into clusters during training procedures. In a cluster layer, the tasks in the same cluster are further required to share the same network. By this way, the cluster layer produces the general presentation for the same cluster, while produces relatively specific presentations for different clusters. As transitions the cluster layers are used…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Text and Document Classification Technologies · Topic Modeling