Offline Reinforcement Learning for Mobile Notifications
Yiping Yuan, Ajith Muralidharan, Preetam Nandy, Miao Cheng, Prakruthi, Prabhakar

TL;DR
This paper introduces an offline reinforcement learning framework for mobile notification systems, improving long-term user engagement by optimizing sequential decisions through offline policy evaluation and online deployment.
Contribution
It presents a novel offline RL approach with a state-marginalized importance sampling method for notification systems, enabling effective offline policy evaluation and deployment.
Findings
Offline RL outperforms traditional response-prediction models.
The proposed evaluation method accurately predicts policy performance.
Online deployment of the RL policy increased user engagement.
Abstract
Mobile notification systems have taken a major role in driving and maintaining user engagement for online platforms. They are interesting recommender systems to machine learning practitioners with more sequential and long-term feedback considerations. Most machine learning applications in notification systems are built around response-prediction models, trying to attribute both short-term impact and long-term impact to a notification decision. However, a user's experience depends on a sequence of notifications and attributing impact to a single notification is not always accurate, if not impossible. In this paper, we argue that reinforcement learning is a better framework for notification systems in terms of performance and iteration speed. We propose an offline reinforcement learning framework to optimize sequential notification decisions for driving user engagement. We describe a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPersonal Information Management and User Behavior · Green IT and Sustainability · Age of Information Optimization
