Deep Hierarchical Reinforcement Learning Based Recommendations via Multi-goals Abstraction
Dongyang Zhao, Liang Zhang, Bo Zhang, Lizhou Zheng, Yongjun Bao,, Weipeng Yan

TL;DR
This paper introduces a deep hierarchical reinforcement learning framework with multi-goals abstraction for recommendation systems, improving conversion metrics by effectively capturing long-term sparse signals through a novel two-level agent structure.
Contribution
The paper proposes a new deep hierarchical reinforcement learning algorithm, HRL-MG, with multi-goals abstraction, shared encoders, and a benefit assignment function to enhance recommendation performance.
Findings
Improved conversion rates on real-world e-commerce data.
Faster convergence due to shared state encoders.
Effective handling of sparse long-term feedback signals.
Abstract
The recommender system is an important form of intelligent application, which assists users to alleviate from information redundancy. Among the metrics used to evaluate a recommender system, the metric of conversion has become more and more important. The majority of existing recommender systems perform poorly on the metric of conversion due to its extremely sparse feedback signal. To tackle this challenge, we propose a deep hierarchical reinforcement learning based recommendation framework, which consists of two components, i.e., high-level agent and low-level agent. The high-level agent catches long-term sparse conversion signals, and automatically sets abstract goals for low-level agent, while the low-level agent follows the abstract goals and interacts with real-time environment. To solve the inherent problem in hierarchical reinforcement learning, we propose a novel deep…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRecommender Systems and Techniques · Advanced Bandit Algorithms Research · Advanced Graph Neural Networks
