MetaBalance: Improving Multi-Task Recommendations via Adapting Gradient   Magnitudes of Auxiliary Tasks

Yun He; Xue Feng; Cheng Cheng; Geng Ji; Yunsong Guo; James Caverlee

arXiv:2203.06801·cs.LG·March 15, 2022

MetaBalance: Improving Multi-Task Recommendations via Adapting Gradient Magnitudes of Auxiliary Tasks

Yun He, Xue Feng, Cheng Cheng, Geng Ji, Yunsong Guo, James Caverlee

PDF

1 Repo

TL;DR

MetaBalance is a novel method that adaptively balances the influence of auxiliary tasks in multi-task recommendation models by adjusting gradient magnitudes, leading to improved recommendation accuracy.

Contribution

It introduces a gradient manipulation technique to dynamically balance auxiliary and target tasks in multi-task learning for recommendations.

Findings

01

Achieves 8.34% improvement in NDCG@10 over baseline.

02

Effectively balances auxiliary task influence during training.

03

Demonstrates robustness across real-world datasets.

Abstract

In many personalized recommendation scenarios, the generalization ability of a target task can be improved via learning with additional auxiliary tasks alongside this target task on a multi-task network. However, this method often suffers from a serious optimization imbalance problem. On the one hand, one or more auxiliary tasks might have a larger influence than the target task and even dominate the network weights, resulting in worse recommendation accuracy for the target task. On the other hand, the influence of one or more auxiliary tasks might be too weak to assist the target task. More challenging is that this imbalance dynamically changes throughout the training process and varies across the parts of the same network. We propose a new method: MetaBalance to balance auxiliary losses via directly manipulating their gradients w.r.t the shared parameters in the multi-task network.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

facebookresearch/metabalance
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.