Hybrid Transfer in Deep Reinforcement Learning for Ads Allocation
Ze Wang, Guogang Liao, Xiaowen Shi, Xiaoxu Wu, Chuheng Zhang, Bingqi, Zhu, Yongkang Wang, Xingxing Wang, Dong Wang

TL;DR
This paper introduces SHTAA, a transfer learning method for deep reinforcement learning in ads allocation, improving performance in data-scarce scenarios and boosting platform revenue.
Contribution
It proposes a novel similarity-based hybrid transfer approach for deep RL in ads allocation, effectively transferring knowledge from data-rich to data-poor entrances.
Findings
Enhanced revenue in data-scarce entrances
Improved performance over baseline methods
Effective transfer of knowledge across entrances
Abstract
Ads allocation, which involves allocating ads and organic items to limited slots in feed with the purpose of maximizing platform revenue, has become a research hotspot. Notice that, e-commerce platforms usually have multiple entrances for different categories and some entrances have few visits. Data from these entrances has low coverage, which makes it difficult for the agent to learn. To address this challenge, we propose Similarity-based Hybrid Transfer for Ads Allocation (SHTAA), which effectively transfers samples as well as knowledge from data-rich entrance to data-poor entrance. Specifically, we define an uncertainty-aware similarity for MDP to estimate the similarity of MDP for different entrances. Based on this similarity, we design a hybrid transfer method, including instance transfer and strategy transfer, to efficiently transfer samples and knowledge from one entrance to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsConsumer Market Behavior and Pricing · Digital Marketing and Social Media · Sharing Economy and Platforms
