In Defense of the Unitary Scalarization for Deep Multi-Task Learning

Vitaly Kurin; Alessandro De Palma; Ilya Kostrikov; Shimon Whiteson; M.; Pawan Kumar

arXiv:2201.04122·cs.LG·March 10, 2023·20 cites

In Defense of the Unitary Scalarization for Deep Multi-Task Learning

Vitaly Kurin, Alessandro De Palma, Ilya Kostrikov, Shimon Whiteson, M., Pawan Kumar

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper demonstrates that simple unitary scalarization combined with standard regularization techniques can match or outperform complex multi-task optimization algorithms, challenging recent skepticism about its effectiveness.

Contribution

The study shows that basic scalarization with regularization rivals complex multi-task optimizers, prompting a reevaluation of current multi-task learning strategies.

Findings

01

Unitary scalarization performs as well or better than complex optimizers.

02

Regularization techniques are key to effective multi-task learning.

03

Many advanced optimizers can be viewed as forms of regularization.

Abstract

Recent multi-task learning research argues against unitary scalarization, where training simply minimizes the sum of the task losses. Several ad-hoc multi-task optimization algorithms have instead been proposed, inspired by various hypotheses about what makes multi-task settings difficult. The majority of these optimizers require per-task gradients, and introduce significant memory, runtime, and implementation overhead. We show that unitary scalarization, coupled with standard regularization and stabilization techniques from single-task learning, matches or improves upon the performance of complex multi-task optimizers in popular supervised and reinforcement learning settings. We then present an analysis suggesting that many specialized multi-task optimizers can be partly interpreted as forms of regularization, potentially explaining our surprising results. We believe our results call…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

yobibyte/unitary-scalarization-dmtl
pytorchOfficial

Videos

In Defense of the Unitary Scalarization for Deep Multi-Task Learning· slideslive

Taxonomy

TopicsSparse and Compressive Sensing Techniques · Stochastic Gradient Optimization Techniques · Advanced Bandit Algorithms Research