Deep Hierarchical Reinforcement Learning Based Recommendations via   Multi-goals Abstraction

Dongyang Zhao; Liang Zhang; Bo Zhang; Lizhou Zheng; Yongjun Bao,; Weipeng Yan

arXiv:1903.09374·cs.LG·March 25, 2019·1 cites

Deep Hierarchical Reinforcement Learning Based Recommendations via Multi-goals Abstraction

Dongyang Zhao, Liang Zhang, Bo Zhang, Lizhou Zheng, Yongjun Bao,, Weipeng Yan

PDF

Open Access

TL;DR

This paper introduces a deep hierarchical reinforcement learning framework with multi-goals abstraction for recommendation systems, improving conversion metrics by effectively capturing long-term sparse signals through a novel two-level agent structure.

Contribution

The paper proposes a new deep hierarchical reinforcement learning algorithm, HRL-MG, with multi-goals abstraction, shared encoders, and a benefit assignment function to enhance recommendation performance.

Findings

01

Improved conversion rates on real-world e-commerce data.

02

Faster convergence due to shared state encoders.

03

Effective handling of sparse long-term feedback signals.

Abstract

The recommender system is an important form of intelligent application, which assists users to alleviate from information redundancy. Among the metrics used to evaluate a recommender system, the metric of conversion has become more and more important. The majority of existing recommender systems perform poorly on the metric of conversion due to its extremely sparse feedback signal. To tackle this challenge, we propose a deep hierarchical reinforcement learning based recommendation framework, which consists of two components, i.e., high-level agent and low-level agent. The high-level agent catches long-term sparse conversion signals, and automatically sets abstract goals for low-level agent, while the low-level agent follows the abstract goals and interacts with real-time environment. To solve the inherent problem in hierarchical reinforcement learning, we propose a novel deep…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRecommender Systems and Techniques · Advanced Bandit Algorithms Research · Advanced Graph Neural Networks