Genie: An Open Box Counterfactual Policy Estimator for Optimizing Sponsored Search Marketplace
Murat Ali Bayir, Mingsen Xu, Yaojia Zhu, Yifan Shi

TL;DR
Genie is an offline counterfactual policy estimation framework that uses an open box simulation engine to optimize sponsored search marketplaces, outperforming existing methods and enabling safe policy tuning without risky experiments.
Contribution
Introducing Genie, a novel open box simulation-based framework for offline policy estimation that improves optimization efficiency and safety in sponsored search marketplaces.
Findings
Genie outperforms observational approaches on Bing traffic.
It enables safe tuning of new policies without risky experiments.
It handles frequent policy updates effectively.
Abstract
In this paper, we propose an offline counterfactual policy estimation framework called Genie to optimize Sponsored Search Marketplace. Genie employs an open box simulation engine with click calibration model to compute the KPI impact of any modification to the system. From the experimental results on Bing traffic, we showed that Genie performs better than existing observational approaches that employs randomized experiments for traffic slices that have frequent policy updates. We also show that Genie can be used to tune completely new policies efficiently without creating risky randomized experiments due to cold start problem. As time of today, Genie hosts more than 10000 optimization jobs yearly which runs more than 30 Million processing node hours of big data jobs for Bing Ads. For the last 3 years, Genie has been proven to be the one of the major platforms to optimize Bing Ads…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMobile Crowdsensing and Crowdsourcing · Recommender Systems and Techniques · Advanced Bandit Algorithms Research
