Compressive Features in Offline Reinforcement Learning for Recommender   Systems

Hung Nguyen; Minh Nguyen; Long Pham; Jennifer Adorno Nieves

arXiv:2111.08817·cs.AI·November 18, 2021

Compressive Features in Offline Reinforcement Learning for Recommender Systems

Hung Nguyen, Minh Nguyen, Long Pham, Jennifer Adorno Nieves

PDF

Open Access

TL;DR

This paper introduces a reinforcement learning-based recommender system for a game, utilizing sparse PCA and user clustering to improve reward outcomes and training efficiency on offline data, achieving state-of-the-art results.

Contribution

The paper presents a novel offline RL approach with feature extraction and user clustering, significantly enhancing reward performance and training speed in recommender systems.

Findings

01

Outperforms state-of-the-art methods in total rewards

02

Achieves approximately 20% better scores

03

Trains 30 times faster than existing methods

Abstract

In this paper, we develop a recommender system for a game that suggests potential items to players based on their interactive behaviors to maximize revenue for the game provider. Our approach is built on a reinforcement learning-based technique and is trained on an offline data set that is publicly available on an IEEE Big Data Cup challenge. The limitation of the offline data set and the curse of high dimensionality pose significant obstacles to solving this problem. Our proposed method focuses on improving the total rewards and performance by tackling these main difficulties. More specifically, we utilized sparse PCA to extract important features of user behaviors. Our Q-learning-based system is then trained from the processed offline data set. To exploit all possible information from the provided data set, we cluster user features to different groups and build an independent Q-table…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Recommender Systems and Techniques · Reinforcement Learning in Robotics

MethodsPrincipal Components Analysis