Leveraging convergence behavior to balance conflicting tasks in multi-task learning
Angelica Tiemi Mizuno Nakamura, Denis Fernando Wolf, Valdir Grassi Jr

TL;DR
This paper introduces a dynamic multi-objective optimization method for multi-task learning that adjusts task importance based on gradient behavior, improving performance on conflicting tasks.
Contribution
It proposes a novel approach leveraging gradient temporal behavior to balance conflicting tasks in multi-task learning, outperforming existing methods.
Findings
Outperforms state-of-the-art methods on conflicting tasks
Ensures all tasks reach good generalization performance
Adapts task importance dynamically during training
Abstract
Multi-Task Learning is a learning paradigm that uses correlated tasks to improve performance generalization. A common way to learn multiple tasks is through the hard parameter sharing approach, in which a single architecture is used to share the same subset of parameters, creating an inductive bias between them during the training process. Due to its simplicity, potential to improve generalization, and reduce computational cost, it has gained the attention of the scientific and industrial communities. However, tasks often conflict with each other, which makes it challenging to define how the gradients of multiple tasks should be combined to allow simultaneous learning. To address this problem, we use the idea of multi-objective optimization to propose a method that takes into account temporal behaviour of the gradients to create a dynamic bias that adjust the importance of each task…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Machine Learning and ELM · Metaheuristic Optimization Algorithms Research
