Divide and Conquer: Towards Better Embedding-based Retrieval for Recommender Systems From a Multi-task Perspective
Yuan Zhang, Xue Dong, Weijie Ding, Biao Li, Peng Jiang, Kun Gai

TL;DR
This paper presents a divide-and-conquer multi-task approach to improve embedding-based retrieval in recommender systems, addressing issues of relevance discrimination, diversity, and fairness in large-scale industrial applications.
Contribution
It introduces a scalable, multi-task learning framework with clustering and task adaptation techniques to enhance retrieval accuracy and control in large-scale recommender systems.
Findings
Improved discrimination between relevant and hard negatives.
Enhanced diversity and fairness in retrieval results.
Effective multi-task learning with minimal overhead.
Abstract
Embedding-based retrieval (EBR) methods are widely used in modern recommender systems thanks to its simplicity and effectiveness. However, along the journey of deploying and iterating on EBR in production, we still identify some fundamental issues in existing methods. First, when dealing with large corpus of candidate items, EBR models often have difficulties in balancing the performance on distinguishing highly relevant items (positives) from both irrelevant ones (easy negatives) and from somewhat related yet not competitive ones (hard negatives). Also, we have little control in the diversity and fairness of the retrieval results because of the ``greedy'' nature of nearest vector search. These issues compromise the performance of EBR methods in large-scale industrial scenarios. This paper introduces a simple and proven-in-production solution to overcome these issues. The proposed…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRecommender Systems and Techniques · Advanced Image and Video Retrieval Techniques · Multimodal Machine Learning Applications
