Robust Task Clustering for Deep Many-Task Learning
Mo Yu, Xiaoxiao Guo, Jinfeng Yi, Shiyu Chang, Saloni Potdar, Gerald, Tesauro, Haoyu Wang, Bowen Zhou

TL;DR
This paper introduces a robust task clustering method for deep multi-task learning that uses matrix completion to accurately identify task groups, improving model training and adaptation for diverse tasks.
Contribution
It proposes a novel task clustering algorithm based on matrix completion, addressing asymmetry and unreliability in transfer performance matrices for better task partitioning.
Findings
Effective discovery of task clusters in sentiment and dialog classification.
Enhanced multi-task learning performance with flexible task grouping.
Empirical improvements in few-shot learning with multiple metrics.
Abstract
We investigate task clustering for deep-learning based multi-task and few-shot learning in a many-task setting. We propose a new method to measure task similarities with cross-task transfer performance matrix for the deep learning scenario. Although this matrix provides us critical information regarding similarity between tasks, its asymmetric property and unreliable performance scores can affect conventional clustering methods adversely. Additionally, the uncertain task-pairs, i.e., the ones with extremely asymmetric transfer scores, may collectively mislead clustering algorithms to output an inaccurate task-partition. To overcome these limitations, we propose a novel task-clustering algorithm by using the matrix completion technique. The proposed algorithm constructs a partially-observed similarity matrix based on the certainty of cluster membership of the task-pairs. We then use a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Machine Learning and ELM · Multimodal Machine Learning Applications
