Revisiting Scalarization in Multi-Task Learning: A Theoretical   Perspective

Yuzheng Hu; Ruicheng Xian; Qilong Wu; Qiuling Fan; Lang Yin; Han Zhao

arXiv:2308.13985·cs.LG·September 25, 2023·5 cites

Revisiting Scalarization in Multi-Task Learning: A Theoretical Perspective

Yuzheng Hu, Ruicheng Xian, Qilong Wu, Qiuling Fan, Lang Yin, Han Zhao

PDF

Open Access 1 Video

TL;DR

This paper provides a theoretical analysis showing that linear scalarization in multi-task learning cannot fully explore the Pareto front, especially for balanced solutions, and demonstrates the potential advantages of specialized multi-task optimizers through experiments.

Contribution

It offers a theoretical perspective revealing the limitations of scalarization in exploring the Pareto front and compares it with SMTOs, supported by empirical evidence.

Findings

01

Scalarization cannot fully explore the Pareto front in under-parametrized models.

02

Necessary and sufficient conditions for full exploration are identified.

03

SMTOs can find balanced solutions that scalarization cannot achieve.

Abstract

Linear scalarization, i.e., combining all loss functions by a weighted sum, has been the default choice in the literature of multi-task learning (MTL) since its inception. In recent years, there is a surge of interest in developing Specialized Multi-Task Optimizers (SMTOs) that treat MTL as a multi-objective optimization problem. However, it remains open whether there is a fundamental advantage of SMTOs over scalarization. In fact, heated debates exist in the community comparing these two types of algorithms, mostly from an empirical perspective. To approach the above question, in this paper, we revisit scalarization from a theoretical perspective. We focus on linear MTL models and study whether scalarization is capable of fully exploring the Pareto front. Our findings reveal that, in contrast to recent works that claimed empirical advantages of scalarization, scalarization is…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Revisiting Scalarization in Multi-Task Learning: A Theoretical Perspective· slideslive

Taxonomy

TopicsMachine Learning and Data Classification · Machine Learning and Algorithms · Advanced Multi-Objective Optimization Algorithms

MethodsFocus