Holistic Robust Data-Driven Decisions
Amine Bennouna, Bart Van Parys, Ryan Lucas

TL;DR
This paper introduces a holistic, computationally feasible data-driven optimization framework that simultaneously guards against statistical error, data noise, and data misspecification, enhancing out-of-sample performance.
Contribution
It proposes a novel distributionally robust optimization formulation combining Kullback-Leibler and Lévy-Prokhorov metrics, providing comprehensive protection against multiple overfitting sources.
Findings
Effective in neural network training with healthcare data under noise and errors
Improves portfolio risk/return tradeoff under distribution shifts
Unifies regularized and robust formulations as special cases
Abstract
The design of data-driven formulations for machine learning and decision-making with good out-of-sample performance is a key challenge. The observation that good in-sample performance does not guarantee good out-of-sample performance is generally known as overfitting. Practical overfitting can typically not be attributed to a single cause but is caused by several factors simultaneously. We consider here three overfitting sources: (i) statistical error as a result of working with finite sample data, (ii) data noise, which occurs when the data points are measured only with finite precision, and finally, (iii) data misspecification in which a small fraction of all data may be wholly corrupted. Although existing data-driven formulations may be robust against one of these three sources in isolation, they do not provide holistic protection against all overfitting sources simultaneously. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Risk and Portfolio Optimization · Forecasting Techniques and Applications
