Leveraging convergence behavior to balance conflicting tasks in   multi-task learning

Angelica Tiemi Mizuno Nakamura; Denis Fernando Wolf; Valdir Grassi Jr

arXiv:2204.06698·cs.LG·April 15, 2022·1 cites

Leveraging convergence behavior to balance conflicting tasks in multi-task learning

Angelica Tiemi Mizuno Nakamura, Denis Fernando Wolf, Valdir Grassi Jr

PDF

Open Access 2 Repos

TL;DR

This paper introduces a dynamic multi-objective optimization method for multi-task learning that adjusts task importance based on gradient behavior, improving performance on conflicting tasks.

Contribution

It proposes a novel approach leveraging gradient temporal behavior to balance conflicting tasks in multi-task learning, outperforming existing methods.

Findings

01

Outperforms state-of-the-art methods on conflicting tasks

02

Ensures all tasks reach good generalization performance

03

Adapts task importance dynamically during training

Abstract

Multi-Task Learning is a learning paradigm that uses correlated tasks to improve performance generalization. A common way to learn multiple tasks is through the hard parameter sharing approach, in which a single architecture is used to share the same subset of parameters, creating an inductive bias between them during the training process. Due to its simplicity, potential to improve generalization, and reduce computational cost, it has gained the attention of the scientific and industrial communities. However, tasks often conflict with each other, which makes it challenging to define how the gradients of multiple tasks should be combined to allow simultaneous learning. To address this problem, we use the idea of multi-objective optimization to propose a method that takes into account temporal behaviour of the gradients to create a dynamic bias that adjust the importance of each task…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Machine Learning and ELM · Metaheuristic Optimization Algorithms Research