Interactive Search Based on Deep Reinforcement Learning
Yang Yu, Zhenhao Gu, Rong Tao, Jingtian Ge, Kenglun Chang

TL;DR
This paper explores an interactive recommendation system using deep reinforcement learning, emphasizing offline training with a virtual user environment and an improved algorithm to enhance recommendation quality.
Contribution
It introduces a virtual user environment for offline training and proposes a bi-clustering based reinforcement learning algorithm to expand action and recommendation spaces.
Findings
Effective virtual environment for offline training created
Bi-clustering algorithm improves recommendation diversity
Enhanced long-term user engagement predicted
Abstract
With the continuous development of machine learning technology, major e-commerce platforms have launched recommendation systems based on it to serve a large number of customers with different needs more efficiently. Compared with traditional supervised learning, reinforcement learning can better capture the user's state transition in the decision-making process, and consider a series of user actions, not just the static characteristics of the user at a certain moment. In theory, it will have a long-term perspective, producing a more effective recommendation. The special requirements of reinforcement learning for data make it need to rely on an offline virtual system for training. Our project mainly establishes a virtual user environment for offline training. At the same time, we tried to improve a reinforcement learning algorithm based on bi-clustering to expand the action space and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Advanced Bandit Algorithms Research · Data Stream Mining Techniques
