TL;DR
SAMO is a lightweight optimization method that integrates sharpness-aware minimization with joint global-local perturbations to effectively mitigate task conflicts in multi-task learning, improving performance and efficiency.
Contribution
Introduces SAMO, a novel lightweight sharpness-aware optimization approach that combines global and local perturbations for better multi-task learning performance.
Findings
SAMO reduces task conflicts in multi-task learning.
SAMO improves efficiency with layerwise normalized local perturbations.
Extensive experiments show SAMO's effectiveness across benchmarks.
Abstract
Multi-task learning (MTL) enables a joint model to capture commonalities across multiple tasks, reducing computation costs and improving data efficiency. However, a major challenge in MTL optimization is task conflicts, where the task gradients differ in direction or magnitude, limiting model performance compared to single-task counterparts. Sharpness-aware minimization (SAM) minimizes task loss while simultaneously reducing the sharpness of the loss landscape. Our empirical observations show that SAM effectively mitigates task conflicts in MTL. Motivated by these findings, we explore integrating SAM into MTL but face two key challenges. While both the average loss gradient and individual task gradients-referred to as global and local information-contribute to SAM, how to combine them remains unclear. Moreover, directly computing each task gradient introduces significant computational…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsSegment Anything Model · Sharpness-Aware Minimization
