Integrating Reward Maximization and Population Estimation: Sequential Decision-Making for Internal Revenue Service Audit Selection
Peter Henderson, Ben Chugg, Brandon Anderson, Kristen Altenburger,, Alex Turk, John Guyton, Jacob Goldin, Daniel E. Ho

TL;DR
This paper introduces a new structured bandit setting that balances reward maximization with unbiased population estimation, demonstrated through IRS audit data to improve tax gap estimation and audit efficacy.
Contribution
It formulates the optimize-and-estimate bandit problem, proposes a novel unbiased estimation mechanism, and applies it to IRS audit selection to enhance policy outcomes.
Findings
Unbiased population estimation achieves comparable rewards to baseline methods.
The approach improves tax gap estimation accuracy.
Potential to enhance IRS audit efficiency and policy relevance.
Abstract
We introduce a new setting, optimize-and-estimate structured bandits. Here, a policy must select a batch of arms, each characterized by its own context, that would allow it to both maximize reward and maintain an accurate (ideally unbiased) population estimate of the reward. This setting is inherent to many public and private sector applications and often requires handling delayed feedback, small data, and distribution shifts. We demonstrate its importance on real data from the United States Internal Revenue Service (IRS). The IRS performs yearly audits of the tax base. Two of its most important objectives are to identify suspected misreporting and to estimate the "tax gap" -- the global difference between the amount paid and true amount owed. Based on a unique collaboration with the IRS, we cast these two processes as a unified optimize-and-estimate structured bandit. We analyze…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Smart Grid Energy Management · Decision-Making and Behavioral Economics
Methodstravel james
