Reasonable Effectiveness of Random Weighting: A Litmus Test for   Multi-Task Learning

Baijiong Lin; Feiyang Ye; Yu Zhang; and Ivor W. Tsang

arXiv:2111.10603·cs.LG·July 28, 2022·38 cites

Reasonable Effectiveness of Random Weighting: A Litmus Test for Multi-Task Learning

Baijiong Lin, Feiyang Ye, Yu Zhang, and Ivor W. Tsang

PDF

Open Access 2 Repos

TL;DR

This paper introduces Random Weighting methods for multi-task learning, demonstrating their theoretical advantages in escaping local minima and empirically showing they perform comparably to state-of-the-art methods across various datasets.

Contribution

The paper proposes simple yet effective Random Weighting methods for MTL, providing theoretical analysis and extensive empirical validation as strong baselines.

Findings

01

RW methods can escape local minima more effectively.

02

RW methods achieve comparable performance to state-of-the-art.

03

RW methods are important baselines for future MTL research.

Abstract

Multi-Task Learning (MTL) has achieved success in various fields. However, how to balance different tasks to achieve good performance is a key problem. To achieve the task balancing, there are many works to carefully design dynamical loss/gradient weighting strategies but the basic random experiments are ignored to examine their effectiveness. In this paper, we propose the Random Weighting (RW) methods, including Random Loss Weighting (RLW) and Random Gradient Weighting (RGW), where an MTL model is trained with random loss/gradient weights sampled from a distribution. To show the effectiveness and necessity of RW methods, theoretically we analyze the convergence of RW and reveal that RW has a higher probability to escape local minima, resulting in better generalization ability. Empirically, we extensively evaluate the proposed RW methods to compare with twelve state-of-the-art methods…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Machine Learning and ELM · Advanced Neural Network Applications