Decision-Focused Evaluation: Analyzing Performance of Deployed Restless Multi-Arm Bandits
Paritosh Verma, Shresth Verma, Aditya Mate, Aparna Taneja, Milind, Tambe

TL;DR
This paper analyzes the real-world deployment of restless multi-arm bandit systems in public health, revealing that higher prediction accuracy does not always lead to better system performance, and introduces decision-focused evaluation metrics.
Contribution
It provides the first analysis of deployed RMAB systems in public health, highlighting the complex relationship between prediction accuracy and system performance, and proposes decision-focused evaluation metrics.
Findings
Improvement in prediction accuracy can degrade overall system performance.
Decision-focused metrics better explain system performance than traditional accuracy measures.
Deployed RMAB systems in public health reveal non-linear effects of prediction improvements.
Abstract
Restless multi-arm bandits (RMABs) is a popular decision-theoretic framework that has been used to model real-world sequential decision making problems in public health, wildlife conservation, communication systems, and beyond. Deployed RMAB systems typically operate in two stages: the first predicts the unknown parameters defining the RMAB instance, and the second employs an optimization algorithm to solve the constructed RMAB instance. In this work we provide and analyze the results from a first-of-its-kind deployment of an RMAB system in public health domain, aimed at improving maternal and child health. Our analysis is focused towards understanding the relationship between prediction accuracy and overall performance of deployed RMAB systems. This is crucial for determining the value of investing in improving predictive accuracy towards improving the final system performance, and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Smart Grid Energy Management
