No More Tuning: Prioritized Multi-Task Learning with Lagrangian Differential Multiplier Methods
Zhengxing Cheng, Yuheng Huang, Zhixuan Zhang, Dan Ou, Qingwen Liu

TL;DR
This paper introduces a Lagrangian-based multi-task learning framework that automatically prioritizes high-importance tasks without manual hyper-parameter tuning, improving performance and reliability in practical systems.
Contribution
It proposes a novel Lagrangian Differential Multiplier Method for multi-task optimization that eliminates the need for task-specific hyper-parameter tuning and ensures theoretical optimization guarantees.
Findings
Significant performance improvements on multiple public datasets.
Enhanced results in large-scale industrial search ranking systems.
Automatic task prioritization without manual hyper-parameter adjustment.
Abstract
Given the ubiquity of multi-task in practical systems, Multi-Task Learning (MTL) has found widespread application across diverse domains. In real-world scenarios, these tasks often have different priorities. For instance, In web search, relevance is often prioritized over other metrics, such as click-through rates or user engagement. Existing frameworks pay insufficient attention to the prioritization among different tasks, which typically adjust task-specific loss function weights to differentiate task priorities. However, this approach encounters challenges as the number of tasks grows, leading to exponential increases in hyper-parameter tuning complexity. Furthermore, the simultaneous optimization of multiple objectives can negatively impact the performance of high-priority tasks due to interference from lower-priority tasks. In this paper, we introduce a novel multi-task learning…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsNeural Networks and Applications · Machine Learning and ELM · Machine Learning and Algorithms
MethodsSoftmax · Attention Is All You Need
