Optimizing Novelty of Top-k Recommendations using Large Language Models   and Reinforcement Learning

Amit Sharma; Hua Li; Xue Li; Jian Jiao

arXiv:2406.14169·cs.IR·June 21, 2024

Optimizing Novelty of Top-k Recommendations using Large Language Models and Reinforcement Learning

Amit Sharma, Hua Li, Xue Li, Jian Jiao

PDF

Open Access

TL;DR

This paper introduces a reinforcement learning approach leveraging large language models to optimize the novelty of top-k recommendations, effectively balancing novelty and accuracy in large-scale recommendation systems.

Contribution

It proposes a novel RL formulation that reduces sample complexity for optimizing recommendation novelty using large language models.

Findings

01

Significant increase in recommendation novelty with minimal recall loss.

02

Effective application on large-scale search engine and product datasets.

03

Reduced sample complexity through item-wise rewards and state space reformulation.

Abstract

Given an input query, a recommendation model is trained using user feedback data (e.g., click data) to output a ranked list of items. In real-world systems, besides accuracy, an important consideration for a new model is novelty of its top-k recommendations w.r.t. an existing deployed model. However, novelty of top-k items is a difficult goal to optimize a model for, since it involves a non-differentiable sorting operation on the model's predictions. Moreover, novel items, by definition, do not have any user feedback data. Given the semantic capabilities of large language models, we address these problems using a reinforcement learning (RL) formulation where large language models provide feedback for the novel items. However, given millions of candidate items, the sample complexity of a standard RL algorithm can be prohibitively high. To reduce sample complexity, we reduce the top-k…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRecommender Systems and Techniques · Expert finding and Q&A systems

MethodsSparse Evolutionary Training