Asymptotically Optimal Regret for Black-Box Predict-then-Optimize
Samuel Tan, Peter I. Frazier

TL;DR
This paper introduces a new loss function called Empirical Soft Regret for training models in predict-then-optimize tasks, achieving asymptotically optimal regret and outperforming existing methods in real-world decision-making scenarios.
Contribution
The paper proposes a novel differentiable loss function for predict-then-optimize problems that lacks special structure and demonstrates its asymptotic optimality and practical effectiveness.
Findings
The ESR loss improves reward outcomes compared to classical metrics.
Theoretical proof of asymptotic optimality for paired data.
Empirical results show superior performance in news recommendation and healthcare tasks.
Abstract
We consider the predict-then-optimize paradigm for decision-making in which a practitioner (1) trains a supervised learning model on historical data of decisions, contexts, and rewards, and then (2) uses the resulting model to make future binary decisions for new contexts by finding the decision that maximizes the model's predicted reward. This approach is common in industry. Past analysis assumes that rewards are observed for all actions for all historical contexts, which is possible only in problems with special structure. Motivated by problems from ads targeting and recommender systems, we study new black-box predict-then-optimize problems that lack this special structure and where we only observe the reward from the action taken. We present a novel loss function, which we call Empirical Soft Regret (ESR), designed to significantly improve reward when used in training compared to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Fault Detection and Control Systems
