# Auctions Meet Bandits: An Empirical Analysis

**Authors:** Mohammad Rashid, Omid Rafieian, Soheil Ghili

arXiv: 2508.21162 · 2025-09-01

## TL;DR

This paper empirically analyzes how multi-armed bandit algorithms, specifically Thompson Sampling, can optimize exploration in auction-based sponsored search, improving revenue and efficiency.

## Contribution

It provides empirical insights into exploration strategies in auction-bandit settings and proposes a customized approach that enhances platform performance.

## Key findings

- Optimized exploration strategies yield significant revenue gains.
- Tailored exploration per keyword improves efficiency.
- Empirical analysis shows differences from canonical bandit problems.

## Abstract

Sponsored search positions are typically allocated through real-time auctions, where the outcomes depend on advertisers' quality-adjusted bids - the product of their bids and quality scores. Although quality scoring helps promote ads with higher conversion outcomes, setting these scores for new advertisers in any given market is challenging, leading to the cold-start problem. To address this, platforms incorporate multi-armed bandit algorithms in auctions to balance exploration and exploitation. However, little is known about the optimal exploration strategies in such auction environments. We utilize data from a leading Asian mobile app store that places sponsored ads for keywords. The platform employs a Thompson Sampling algorithm within a second-price auction to learn quality scores and allocate a single sponsored position for each keyword. We empirically quantify the gains from optimizing exploration under this combined auction-bandit model and show that this problem differs substantially from the canonical bandit problem. Drawing on these empirical insights, we propose a customized exploration strategy in which the platform adjusts the exploration levels for each keyword according to its characteristics. We derive the Pareto frontier for revenue and efficiency and provide actionable policies, demonstrating substantial gains for the platform on both metrics when using a tailored exploration approach.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/2508.21162/full.md

## Figures

21 figures with captions in the complete paper: https://tomesphere.com/paper/2508.21162/full.md

## References

33 references — full list in the complete paper: https://tomesphere.com/paper/2508.21162/full.md

---
Source: https://tomesphere.com/paper/2508.21162