Asymptotically Optimal Regret for Black-Box Predict-then-Optimize

Samuel Tan; Peter I. Frazier

arXiv:2406.07866·cs.LG·June 13, 2024

Asymptotically Optimal Regret for Black-Box Predict-then-Optimize

Samuel Tan, Peter I. Frazier

PDF

Open Access

TL;DR

This paper introduces a new loss function called Empirical Soft Regret for training models in predict-then-optimize tasks, achieving asymptotically optimal regret and outperforming existing methods in real-world decision-making scenarios.

Contribution

The paper proposes a novel differentiable loss function for predict-then-optimize problems that lacks special structure and demonstrates its asymptotic optimality and practical effectiveness.

Findings

01

The ESR loss improves reward outcomes compared to classical metrics.

02

Theoretical proof of asymptotic optimality for paired data.

03

Empirical results show superior performance in news recommendation and healthcare tasks.

Abstract

We consider the predict-then-optimize paradigm for decision-making in which a practitioner (1) trains a supervised learning model on historical data of decisions, contexts, and rewards, and then (2) uses the resulting model to make future binary decisions for new contexts by finding the decision that maximizes the model's predicted reward. This approach is common in industry. Past analysis assumes that rewards are observed for all actions for all historical contexts, which is possible only in problems with special structure. Motivated by problems from ads targeting and recommender systems, we study new black-box predict-then-optimize problems that lack this special structure and where we only observe the reward from the action taken. We present a novel loss function, which we call Empirical Soft Regret (ESR), designed to significantly improve reward when used in training compared to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Fault Detection and Control Systems