Dynamic Learning of Sequential Choice Bandit Problem under Marketing Fatigue
Junyu Cao, Wei Sun

TL;DR
This paper introduces a dynamic learning framework for sequential marketing messages, modeling user abandonment due to fatigue, and develops algorithms to optimize message sequencing and personalization over time.
Contribution
It proposes a novel sequential choice model capturing user interactions and abandonment, along with efficient algorithms for offline optimization and online learning with regret analysis.
Findings
Efficient polynomial-time algorithm for offline optimization.
An online algorithm with proven regret bounds.
Extension of the model to include user personalization.
Abstract
Motivated by the observation that overexposure to unwanted marketing activities leads to customer dissatisfaction, we consider a setting where a platform offers a sequence of messages to its users and is penalized when users abandon the platform due to marketing fatigue. We propose a novel sequential choice model to capture multiple interactions taking place between the platform and its user: Upon receiving a message, a user decides on one of the three actions: accept the message, skip and receive the next message, or abandon the platform. Based on user feedback, the platform dynamically learns users' abandonment distribution and their valuations of messages to determine the length of the sequence and the order of the messages, while maximizing the cumulative payoff over a horizon of length T. We refer to this online learning task as the sequential choice bandit problem. For the offline…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Consumer Market Behavior and Pricing · Auction Theory and Applications
