Compressive Features in Offline Reinforcement Learning for Recommender Systems
Hung Nguyen, Minh Nguyen, Long Pham, Jennifer Adorno Nieves

TL;DR
This paper introduces a reinforcement learning-based recommender system for a game, utilizing sparse PCA and user clustering to improve reward outcomes and training efficiency on offline data, achieving state-of-the-art results.
Contribution
The paper presents a novel offline RL approach with feature extraction and user clustering, significantly enhancing reward performance and training speed in recommender systems.
Findings
Outperforms state-of-the-art methods in total rewards
Achieves approximately 20% better scores
Trains 30 times faster than existing methods
Abstract
In this paper, we develop a recommender system for a game that suggests potential items to players based on their interactive behaviors to maximize revenue for the game provider. Our approach is built on a reinforcement learning-based technique and is trained on an offline data set that is publicly available on an IEEE Big Data Cup challenge. The limitation of the offline data set and the curse of high dimensionality pose significant obstacles to solving this problem. Our proposed method focuses on improving the total rewards and performance by tackling these main difficulties. More specifically, we utilized sparse PCA to extract important features of user behaviors. Our Q-learning-based system is then trained from the processed offline data set. To exploit all possible information from the provided data set, we cluster user features to different groups and build an independent Q-table…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Recommender Systems and Techniques · Reinforcement Learning in Robotics
MethodsPrincipal Components Analysis
