JDRec: Practical Actor-Critic Framework for Online Combinatorial   Recommender System

Xin Zhao (1); Zhiwei Fang (1); Yuchen Guo (2); Jie He (1); Wenlong; Chen (1); Changping Peng (1) ((1) JD.com; (2) Tsinghua University)

arXiv:2207.13311·cs.IR·July 28, 2022

JDRec: Practical Actor-Critic Framework for Online Combinatorial Recommender System

Xin Zhao (1), Zhiwei Fang (1), Yuchen Guo (2), Jie He (1), Wenlong, Chen (1), Changping Peng (1) ((1) JD.com, (2) Tsinghua University)

PDF

Open Access

TL;DR

JDRec introduces a practical actor-critic framework for online combinatorial recommendation, optimizing list generation and evaluation to improve user engagement and platform value.

Contribution

The paper presents a novel actor-critic reinforcement learning framework tailored for online combinatorial recommender systems, addressing efficiency, personalization, and dynamic adaptation.

Findings

01

Improved click-through rate by 2.6% in online JD recommendation

02

Enhanced platform value by 5.03% through JDRec

03

Demonstrated effectiveness in offline and online experiments

Abstract

A combinatorial recommender (CR) system feeds a list of items to a user at a time in the result page, in which the user behavior is affected by both contextual information and items. The CR is formulated as a combinatorial optimization problem with the objective of maximizing the recommendation reward of the whole list. Despite its importance, it is still a challenge to build a practical CR system, due to the efficiency, dynamics, personalization requirement in online environment. In particular, we tear the problem into two sub-problems, list generation and list evaluation. Novel and practical model architectures are designed for these sub-problems aiming at jointly optimizing effectiveness and efficiency. In order to adapt to online case, a bootstrap algorithm forming an actor-critic reinforcement framework is given to explore better recommendation mode in long-term user interaction.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRecommender Systems and Techniques · Artificial Intelligence in Games · Advanced Bandit Algorithms Research