Multi-objective Optimization of Notifications Using Offline   Reinforcement Learning

Prakruthi Prabhakar; Yiping Yuan; Guangyu Yang; Wensheng Sun; Ajith; Muralidharan

arXiv:2207.03029·cs.LG·July 8, 2022

Multi-objective Optimization of Notifications Using Offline Reinforcement Learning

Prakruthi Prabhakar, Yiping Yuan, Guangyu Yang, Wensheng Sun, Ajith, Muralidharan

PDF

Open Access

TL;DR

This paper presents an offline reinforcement learning framework for optimizing multi-objective notification decisions in mobile systems, addressing challenges like distributional shift and demonstrating improved performance through experiments.

Contribution

It introduces a novel offline RL approach using Conservative Q-learning for multi-objective notification optimization in mobile systems.

Findings

01

Improved notification decision quality in offline and online tests

02

Effective mitigation of distributional shift in offline RL

03

Enhanced user engagement through optimized notifications

Abstract

Mobile notification systems play a major role in a variety of applications to communicate, send alerts and reminders to the users to inform them about news, events or messages. In this paper, we formulate the near-real-time notification decision problem as a Markov Decision Process where we optimize for multiple objectives in the rewards. We propose an end-to-end offline reinforcement learning framework to optimize sequential notification decisions. We address the challenge of offline learning using a Double Deep Q-network method based on Conservative Q-learning that mitigates the distributional shift problem and Q-value overestimation. We illustrate our fully-deployed system and demonstrate the performance and benefits of the proposed approach through both offline and online experiments.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsCaching and Content Delivery · Green IT and Sustainability · Opportunistic and Delay-Tolerant Networks

MethodsQ-Learning