Optimizing Novelty of Top-k Recommendations using Large Language Models and Reinforcement Learning
Amit Sharma, Hua Li, Xue Li, Jian Jiao

TL;DR
This paper introduces a reinforcement learning approach leveraging large language models to optimize the novelty of top-k recommendations, effectively balancing novelty and accuracy in large-scale recommendation systems.
Contribution
It proposes a novel RL formulation that reduces sample complexity for optimizing recommendation novelty using large language models.
Findings
Significant increase in recommendation novelty with minimal recall loss.
Effective application on large-scale search engine and product datasets.
Reduced sample complexity through item-wise rewards and state space reformulation.
Abstract
Given an input query, a recommendation model is trained using user feedback data (e.g., click data) to output a ranked list of items. In real-world systems, besides accuracy, an important consideration for a new model is novelty of its top-k recommendations w.r.t. an existing deployed model. However, novelty of top-k items is a difficult goal to optimize a model for, since it involves a non-differentiable sorting operation on the model's predictions. Moreover, novel items, by definition, do not have any user feedback data. Given the semantic capabilities of large language models, we address these problems using a reinforcement learning (RL) formulation where large language models provide feedback for the novel items. However, given millions of candidate items, the sample complexity of a standard RL algorithm can be prohibitively high. To reduce sample complexity, we reduce the top-k…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRecommender Systems and Techniques · Expert finding and Q&A systems
MethodsSparse Evolutionary Training
