Optimized Supergeo Design: A Scalable Framework for Geographic Marketing Experiments
Charles Shaw

TL;DR
This paper introduces OSD, a scalable two-stage framework for designing geographic experiments that optimizes Supergeo partitions to improve covariate balance and efficiency in large markets.
Contribution
The paper presents a novel scalable framework combining PCA and MILP for Supergeo design, addressing NP-hardness and computational challenges in geographic experiments.
Findings
OSD achieves near-optimal covariate balance and efficiency.
Scales to 1,000 markets with rapid computation.
Outperforms spectral embeddings in covariate balance.
Abstract
Geographic experiments are a widely-used methodology for measuring incremental return on ad spend (iROAS) at scale, yet their design presents significant challenges. The unit count is small, heterogeneity is large, and the optimal Supergeo partitioning problem is NP-hard. We introduce Optimized Supergeo Design (OSD), a two-stage framework that renders Supergeo designs practical for large-scale markets. Principal Component Analysis (PCA) first reduces the covariate space to create interpretable geo-embeddings. A Mixed-Integer Linear Programming (MILP) solver then selects a partition that balances both baseline outcomes and pre-treatment covariates. We provide theoretical arguments that OSD's objective value is within of the global optimum under community-structure assumptions. Rigorous ablation analysis on synthetic data shows that PCA- and random-embedding Supergeo…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsConsumer Market Behavior and Pricing
