Learning and Decision-Making with Data: Optimal Formulations and Phase Transitions
Amine Bennouna, Bart P.G. Van Parys

TL;DR
This paper investigates optimal data-driven decision-making formulations based on historical data, revealing phase transitions between different regimes and connecting robust, entropic, and variance penalized approaches.
Contribution
It introduces a novel framework for designing optimal data-driven formulations by measuring their out-of-sample performance and characterizes phase transitions among three regimes.
Findings
Identifies three distinct out-of-sample performance regimes.
Shows the existence of phase transitions between regimes.
Connects robust, entropic, and variance penalized formulations.
Abstract
We study the problem of designing optimal learning and decision-making formulations when only historical data is available. Prior work typically commits to a particular class of data-driven formulation and subsequently tries to establish out-of-sample performance guarantees. We take here the opposite approach. We define first a sensible yard stick with which to measure the quality of any data-driven formulation and subsequently seek to find an optimal such formulation. Informally, any data-driven formulation can be seen to balance a measure of proximity of the estimated cost to the actual cost while guaranteeing a level of out-of-sample performance. Given an acceptable level of out-of-sample performance, we construct explicitly a data-driven formulation that is uniformly closer to the true cost than any other formulation enjoying the same out-of-sample performance. We show the existence…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsProbabilistic and Robust Engineering Design · Advanced Multi-Objective Optimization Algorithms · Gaussian Processes and Bayesian Inference
