RLTP: Reinforcement Learning to Pace for Delayed Impression Modeling in Preloaded Ads
Penghui Wei, Yongqiang Chen, Shaoguo Liu, Liang Wang, Bo Zheng

TL;DR
This paper introduces RLTP, a reinforcement learning framework designed to improve impression pacing for preloaded ads, effectively handling delayed feedback and optimizing delivery performance and impression guarantees.
Contribution
RLTP is the first reinforcement learning-based pacing algorithm tailored for preloaded ads with delayed impressions, jointly optimizing impression count and delivery quality.
Findings
RLTP outperforms baseline algorithms significantly in experiments.
Online deployment shows improvements in delivery completion and click-through rates.
RLTP effectively manages delayed feedback in impression pacing.
Abstract
To increase brand awareness, many advertisers conclude contracts with advertising platforms to purchase traffic and then deliver advertisements to target audiences. In a whole delivery period, advertisers usually desire a certain impression count for the ads, and they also expect that the delivery performance is as good as possible (e.g., obtaining high click-through rate). Advertising platforms employ pacing algorithms to satisfy the demands via adjusting the selection probabilities to traffic requests in real-time. However, the delivery procedure is also affected by the strategies from publishers, which cannot be controlled by advertising platforms. Preloading is a widely used strategy for many types of ads (e.g., video ads) to make sure that the response time for displaying after a traffic request is legitimate, which results in delayed impression phenomenon. Traditional pacing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTransportation and Mobility Innovations · Privacy, Security, and Data Protection · Green IT and Sustainability
Methodsfail
