DSL: Understanding and Improving Softmax Recommender Systems with Competition-Aware Scaling
Bucher Sahyouni, Matthew Vowels, Liqun Chen, Simon Hadfield

TL;DR
This paper introduces Dual-scale Softmax Loss (DSL), a novel loss function for recommender systems that adapts competition sharpness based on sampled negatives, leading to significant improvements in accuracy, robustness, and fairness.
Contribution
DSL is a new loss function that infers effective sharpness from competition, reshaping negative sampling and temperature to enhance recommender system training.
Findings
DSL improves accuracy by over 10% in several benchmarks.
DSL enhances robustness under popularity shifts, averaging 9.31% gains.
Theoretical analysis shows DSL reshapes robust payoff and KL deviation.
Abstract
Softmax Loss (SL) is being increasingly adopted for recommender systems (RS) as it has demonstrated better performance, robustness and fairness. Yet in implicit-feedback, a single global temperature and equal treatment of uniformly sampled negatives can lead to brittle training, because sampled sets may contain varying degrees of relevant or informative competitors. The optimal loss sharpness for a user-item pair with a particular set of negatives, can be suboptimal or destabilising for another with different negatives. We introduce Dual-scale Softmax Loss (DSL), which infers effective sharpness from the sampled competition itself. DSL adds two complementary branches to the log-sum-exp backbone. Firstly it reweights negatives within each training instance using hardness and item--item similarity, secondly it adapts a per-example temperature from the competition intensity over a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRecommender Systems and Techniques · Explainable Artificial Intelligence (XAI) · Mobile Crowdsensing and Crowdsourcing
