Factual and Personalized Recommendations using Language Models and   Reinforcement Learning

Jihwan Jeong; Yinlam Chow; Guy Tennenholtz; Chih-Wei Hsu; Azamat; Tulepbergenov; Mohammad Ghavamzadeh; Craig Boutilier

arXiv:2310.06176·cs.AI·October 11, 2023

Factual and Personalized Recommendations using Language Models and Reinforcement Learning

Jihwan Jeong, Yinlam Chow, Guy Tennenholtz, Chih-Wei Hsu, Azamat, Tulepbergenov, Mohammad Ghavamzadeh, Craig Boutilier

PDF

Open Access

TL;DR

This paper introduces P4LM, a language model that combines factual grounding and reinforcement learning to provide personalized, precise, and explainable recommendations in conversational systems, demonstrated on movie recommendations.

Contribution

The work presents a novel language model architecture that integrates factual grounding and reinforcement learning for personalized recommendations with explanations.

Findings

01

P4LM outperforms baseline models in personalization and relevance.

02

Reinforcement learning improves the factual accuracy of recommendations.

03

The model effectively generates engaging and tailored movie narratives.

Abstract

Recommender systems (RSs) play a central role in connecting users to content, products, and services, matching candidate items to users based on their preferences. While traditional RSs rely on implicit user feedback signals, conversational RSs interact with users in natural language. In this work, we develop a comPelling, Precise, Personalized, Preference-relevant language model (P4LM) that recommends items to users while putting emphasis on explaining item characteristics and their relevance. P4LM uses the embedding space representation of a user's preferences to generate compelling responses that are factually-grounded and relevant w.r.t. the user's preferences. Moreover, we develop a joint reward function that measures precision, appeal, and personalization, which we use as AI-based feedback in a reinforcement learning-based language model framework. Using the MovieLens 25M dataset,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRecommender Systems and Techniques · Topic Modeling · Machine Learning in Healthcare