On the Role of Weight Decay in Collaborative Filtering: A Popularity Perspective
Donald Loveland, Mingxuan Ju, Tong Zhao, Neil Shah, Danai Koutra

TL;DR
This paper reveals that weight decay in collaborative filtering encodes item popularity into embeddings, and introduces PRISM, a method that pre-encodes popularity to improve training efficiency and performance.
Contribution
The work uncovers the role of weight decay in encoding popularity and proposes PRISM, a new initialization strategy that eliminates the need for weight decay in CF models.
Findings
PRISM improves recommendation accuracy by up to 4.77%.
PRISM reduces training time by 38.48%.
Tuning weight decay influences preference towards popular or unpopular items.
Abstract
Collaborative filtering (CF) enables large-scale recommendation systems by encoding information from historical user-item interactions into dense ID-embedding tables. However, as embedding tables grow, closed-form solutions become impractical, often necessitating the use of mini-batch gradient descent for training. Despite extensive work on designing loss functions to train CF models, we argue that one core component of these pipelines is heavily overlooked: weight decay. Attaining high-performing models typically requires careful tuning of weight decay, regardless of loss, yet its necessity is not well understood. In this work, we question why weight decay is crucial in CF pipelines and how it impacts training. Through theoretical and empirical analysis, we surprisingly uncover that weight decay's primary function is to encode popularity information into the magnitudes of the embedding…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRecommender Systems and Techniques · Advanced Graph Neural Networks · Mobile Crowdsensing and Crowdsourcing
MethodsWeight Decay
