Independent Component Alignment for Multi-Task Learning

Dmitry Senushkin; Nikolay Patakin; Arseny Kuznetsov; Anton Konushin

arXiv:2305.19000·cs.CV·May 31, 2023·1 cites

Independent Component Alignment for Multi-Task Learning

Dmitry Senushkin, Nikolay Patakin, Arseny Kuznetsov, Anton Konushin

PDF

Open Access 1 Repo

TL;DR

This paper introduces Aligned-MTL, a novel multi-task learning optimization method that uses a condition number criterion to improve stability and convergence, allowing explicit control over task trade-offs.

Contribution

The paper proposes a new stability criterion based on the condition number of gradient systems and an optimization approach that aligns gradient components to enhance multi-task learning.

Findings

01

Consistently improves performance across various MTL benchmarks.

02

Ensures convergence to a pre-defined task trade-off point.

03

Provides a theoretically grounded method for stable and controllable MTL optimization.

Abstract

In a multi-task learning (MTL) setting, a single model is trained to tackle a diverse set of tasks jointly. Despite rapid progress in the field, MTL remains challenging due to optimization issues such as conflicting and dominating gradients. In this work, we propose using a condition number of a linear system of gradients as a stability criterion of an MTL optimization. We theoretically demonstrate that a condition number reflects the aforementioned optimization issues. Accordingly, we present Aligned-MTL, a novel MTL optimization approach based on the proposed criterion, that eliminates instability in the training process by aligning the orthogonal components of the linear system of gradients. While many recent MTL approaches guarantee convergence to a minimum, task trade-offs cannot be specified in advance. In contrast, Aligned-MTL provably converges to an optimal point with…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

torchjd/torchjd
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Machine Learning and ELM · Sparse and Compressive Sensing Techniques