Interactive Search Based on Deep Reinforcement Learning

Yang Yu; Zhenhao Gu; Rong Tao; Jingtian Ge; Kenglun Chang

arXiv:2012.06052·cs.LG·December 14, 2020·1 cites

Interactive Search Based on Deep Reinforcement Learning

Yang Yu, Zhenhao Gu, Rong Tao, Jingtian Ge, Kenglun Chang

PDF

Open Access

TL;DR

This paper explores an interactive recommendation system using deep reinforcement learning, emphasizing offline training with a virtual user environment and an improved algorithm to enhance recommendation quality.

Contribution

It introduces a virtual user environment for offline training and proposes a bi-clustering based reinforcement learning algorithm to expand action and recommendation spaces.

Findings

01

Effective virtual environment for offline training created

02

Bi-clustering algorithm improves recommendation diversity

03

Enhanced long-term user engagement predicted

Abstract

With the continuous development of machine learning technology, major e-commerce platforms have launched recommendation systems based on it to serve a large number of customers with different needs more efficiently. Compared with traditional supervised learning, reinforcement learning can better capture the user's state transition in the decision-making process, and consider a series of user actions, not just the static characteristics of the user at a certain moment. In theory, it will have a long-term perspective, producing a more effective recommendation. The special requirements of reinforcement learning for data make it need to rely on an offline virtual system for training. Our project mainly establishes a virtual user environment for offline training. At the same time, we tried to improve a reinforcement learning algorithm based on bi-clustering to expand the action space and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Advanced Bandit Algorithms Research · Data Stream Mining Techniques