TL;DR
MetaBalance is a novel method that adaptively balances the influence of auxiliary tasks in multi-task recommendation models by adjusting gradient magnitudes, leading to improved recommendation accuracy.
Contribution
It introduces a gradient manipulation technique to dynamically balance auxiliary and target tasks in multi-task learning for recommendations.
Findings
Achieves 8.34% improvement in NDCG@10 over baseline.
Effectively balances auxiliary task influence during training.
Demonstrates robustness across real-world datasets.
Abstract
In many personalized recommendation scenarios, the generalization ability of a target task can be improved via learning with additional auxiliary tasks alongside this target task on a multi-task network. However, this method often suffers from a serious optimization imbalance problem. On the one hand, one or more auxiliary tasks might have a larger influence than the target task and even dominate the network weights, resulting in worse recommendation accuracy for the target task. On the other hand, the influence of one or more auxiliary tasks might be too weak to assist the target task. More challenging is that this imbalance dynamically changes throughout the training process and varies across the parts of the same network. We propose a new method: MetaBalance to balance auxiliary losses via directly manipulating their gradients w.r.t the shared parameters in the multi-task network.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
