Decision-Theoretic Approaches for Improved Learning-Augmented Algorithms

Spyros Angelopoulos; Christoph D\"urr; Georgii Melidi

arXiv:2501.17701·cs.DS·September 16, 2025

Decision-Theoretic Approaches for Improved Learning-Augmented Algorithms

Spyros Angelopoulos, Christoph D\"urr, Georgii Melidi

PDF

Open Access 3 Reviews

TL;DR

This paper develops decision-theoretic metrics to evaluate and improve learning-augmented algorithms, enabling better performance trade-offs and comparisons across algorithms using deterministic and stochastic measures.

Contribution

It introduces a systematic framework using decision-theoretic metrics for analyzing algorithms with learned predictions, applicable to various online decision problems.

Findings

01

Framework quantifies algorithm performance across prediction errors

02

Balances performance and risk in algorithm design

03

Applied to ski-rental, one-max search, and contract scheduling

Abstract

We initiate the systematic study of decision-theoretic metrics in the design and analysis of algorithms with machine-learned predictions. We introduce approaches based on both deterministic measures such as distance-based evaluation, that help us quantify how close the algorithm is to an ideal solution, and stochastic measures that balance the trade-off between the algorithm's performance and the risk associated with the imperfect oracle. These approaches allow us to quantify the algorithm's performance across the full spectrum of the prediction error, and thus choose the best algorithm within an entire class of otherwise incomparable ones. We apply our framework to three well-known problems from online decision making, namely ski-rental, one-max search, and contract scheduling.

Peer Reviews

Decision·ICLR 2026 Poster

Reviewer 01Rating 6Confidence 2

Strengths

Pre-existing work centers on consistency/robustness trade-offs and Pareto-optimality (e.g., caching, one-way trading, search) or on tolerance windows and distributional advice. Comparison of algorithms among those Pareto optimal algorithms is a new thing. The frameworks for distance measures and CVaR-based risk are both nice, actionable, and well-posed. They benchmark on synthetic and real data and report better average ratios or profits than Pareto-optimal (PO) and $\delta$-tolerance baseline

Weaknesses

I actually don't understand the benefit of going beyond Pareto, or why we wouldn't just let practitioners choose one algorithm from the Pareto set. Everyone has different preferences for trading off Consistency and Robustness; given the Pareto-optimal algorithms curve, a practitioner's preferences uniquely pinpoint one algorithm (as in economics class, where the optimal consumption position is where the indifference curve is tangent to the budget line).

Reviewer 02Rating 6Confidence 1

Strengths

-The paper is generally easy to read and follow; it is well motivated and easy to understand even for someone outside the field. -The choice of CVAR is one that is easy to accept, being commonly used in decision theory. - For a theory paper, it is nice to see some experiments, though they seem to be slightly contrived. Disclaimer: I am not from the area and cannot confidently comment on novelty nor quality.

Weaknesses

- No major issues here from me. - Minor criticism : Based on my understanding of the experiments, it seems that the authors are showing that by directly optimizing their metrics, better "results" are obtained based on those very metrics. This is of course unsurprising, so claims like "distance-based algorithms offer considerable improvements over the sota" aren't that fair.

Reviewer 03Rating 6Confidence 2

Strengths

* Systematic approach helps illustrate parallels between related problems, and provides a principled way to choose the "best" algorithm from a set of options. * Risk-oriented analysis is well-motivated by the increasing application of online learning methods in high-stakes applications. * Algorithmic results are evaluated both theoretically and empirically.

Weaknesses

* Scope of novel contributions is unclear. Apart from the value of a unified perspective, it is not completely clear how algorithmic results compare to existing upper and lower bounds known in the literature. * Limitations and practical applicability are not discussed explicitly. * Empirical evaluation in the body of the paper is limited to synthetic data. Appendix D.3 seems to provide some empirical evaluation, but analysis does not seem to be conclusive (i.e., there doesn't seem to be an algor

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications

MethodsFocus