Decision-Theoretic Approaches for Improved Learning-Augmented Algorithms
Spyros Angelopoulos, Christoph D\"urr, Georgii Melidi

TL;DR
This paper develops decision-theoretic metrics to evaluate and improve learning-augmented algorithms, enabling better performance trade-offs and comparisons across algorithms using deterministic and stochastic measures.
Contribution
It introduces a systematic framework using decision-theoretic metrics for analyzing algorithms with learned predictions, applicable to various online decision problems.
Findings
Framework quantifies algorithm performance across prediction errors
Balances performance and risk in algorithm design
Applied to ski-rental, one-max search, and contract scheduling
Abstract
We initiate the systematic study of decision-theoretic metrics in the design and analysis of algorithms with machine-learned predictions. We introduce approaches based on both deterministic measures such as distance-based evaluation, that help us quantify how close the algorithm is to an ideal solution, and stochastic measures that balance the trade-off between the algorithm's performance and the risk associated with the imperfect oracle. These approaches allow us to quantify the algorithm's performance across the full spectrum of the prediction error, and thus choose the best algorithm within an entire class of otherwise incomparable ones. We apply our framework to three well-known problems from online decision making, namely ski-rental, one-max search, and contract scheduling.
Peer Reviews
Decision·ICLR 2026 Poster
Pre-existing work centers on consistency/robustness trade-offs and Pareto-optimality (e.g., caching, one-way trading, search) or on tolerance windows and distributional advice. Comparison of algorithms among those Pareto optimal algorithms is a new thing. The frameworks for distance measures and CVaR-based risk are both nice, actionable, and well-posed. They benchmark on synthetic and real data and report better average ratios or profits than Pareto-optimal (PO) and $\delta$-tolerance baseline
I actually don't understand the benefit of going beyond Pareto, or why we wouldn't just let practitioners choose one algorithm from the Pareto set. Everyone has different preferences for trading off Consistency and Robustness; given the Pareto-optimal algorithms curve, a practitioner's preferences uniquely pinpoint one algorithm (as in economics class, where the optimal consumption position is where the indifference curve is tangent to the budget line).
-The paper is generally easy to read and follow; it is well motivated and easy to understand even for someone outside the field. -The choice of CVAR is one that is easy to accept, being commonly used in decision theory. - For a theory paper, it is nice to see some experiments, though they seem to be slightly contrived. Disclaimer: I am not from the area and cannot confidently comment on novelty nor quality.
- No major issues here from me. - Minor criticism : Based on my understanding of the experiments, it seems that the authors are showing that by directly optimizing their metrics, better "results" are obtained based on those very metrics. This is of course unsurprising, so claims like "distance-based algorithms offer considerable improvements over the sota" aren't that fair.
* Systematic approach helps illustrate parallels between related problems, and provides a principled way to choose the "best" algorithm from a set of options. * Risk-oriented analysis is well-motivated by the increasing application of online learning methods in high-stakes applications. * Algorithmic results are evaluated both theoretically and empirically.
* Scope of novel contributions is unclear. Apart from the value of a unified perspective, it is not completely clear how algorithmic results compare to existing upper and lower bounds known in the literature. * Limitations and practical applicability are not discussed explicitly. * Empirical evaluation in the body of the paper is limited to synthetic data. Appendix D.3 seems to provide some empirical evaluation, but analysis does not seem to be conclusive (i.e., there doesn't seem to be an algor
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications
MethodsFocus
