CTR is not Enough: a Novel Reinforcement Learning based Ranking Approach   for Optimizing Session Clicks

Shaowei Liu; Yangjun Liu

arXiv:2308.14056·cs.IR·August 29, 2023

CTR is not Enough: a Novel Reinforcement Learning based Ranking Approach for Optimizing Session Clicks

Shaowei Liu, Yangjun Liu

PDF

Open Access

TL;DR

This paper introduces a reinforcement learning-based ranking method that optimizes long-term user engagement by considering user leaving behavior, outperforming traditional CTR-focused approaches in recommendation systems.

Contribution

It proposes a novel long-term optimization goal called CTE and develops a reinforcement learning model to improve session-level recommendation performance.

Findings

01

Significant improvement in session click counts in offline tests

02

Enhanced online performance in TaoBao e-commerce platform

03

Effective modeling of user leaving behavior

Abstract

Ranking is a crucial module using in the recommender system. In particular, the ranking module using in our YoungTao recommendation scenario is to provide an ordered list of items to users, to maximize the click number throughout the recommendation session for each user. However, we found that the traditional ranking method for optimizing Click-Through rate(CTR) cannot address our ranking scenario well, since it completely ignores user leaving, and CTR is the optimization goal for the one-step recommendation. To effectively undertake the purpose of our ranking module, we propose a long-term optimization goal, named as CTE (Click-Through quantity expectation), for explicitly taking the behavior of user leaving into account. Based on CTE, we propose an effective model trained by reinforcement learning. Moreover, we build a simulation environment from offline log data for estimating PBR…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRecommender Systems and Techniques · Advanced Bandit Algorithms Research · Smart Grid Energy Management