Improving Training Stability for Multitask Ranking Models in Recommender   Systems

Jiaxi Tang; Yoel Drori; Daryl Chang; Maheswaran Sathiamoorthy; Justin; Gilmer; Li Wei; Xinyang Yi; Lichan Hong; Ed H. Chi

arXiv:2302.09178·cs.LG·June 16, 2023·1 cites

Improving Training Stability for Multitask Ranking Models in Recommender Systems

Jiaxi Tang, Yoel Drori, Daryl Chang, Maheswaran Sathiamoorthy, Justin, Gilmer, Li Wei, Xinyang Yi, Lichan Hong, Ed H. Chi

PDF

Open Access 2 Repos

TL;DR

This paper addresses the challenge of training instability in large, complex multitask ranking models for recommender systems, proposing a new algorithm that enhances stability without sacrificing convergence.

Contribution

It identifies properties causing instability, analyzes why existing solutions fail, and introduces a novel algorithm to improve training stability in real-world recommender models.

Findings

01

The proposed algorithm significantly improves training stability.

02

It maintains model convergence while enhancing stability.

03

Experiments on YouTube data validate effectiveness.

Abstract

Recommender systems play an important role in many content platforms. While most recommendation research is dedicated to designing better models to improve user experience, we found that research on stabilizing the training for such models is severely under-explored. As recommendation models become larger and more sophisticated, they are more susceptible to training instability issues, i.e., loss divergence, which can make the model unusable, waste significant resources and block model developments. In this paper, we share our findings and best practices we learned for improving the training stability of a real-world multitask ranking model for YouTube recommendations. We show some properties of the model that lead to unstable training and conjecture on the causes. Furthermore, based on our observations of training dynamics near the point of training instability, we hypothesize why…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Recommender Systems and Techniques · Stochastic Gradient Optimization Techniques

Methodsfail