Risk-Averse Learning with Varying Risk Levels

Siyi Wang; Zifan Wang; Karl H. Johansson

arXiv:2512.22986·math.OC·December 30, 2025

Risk-Averse Learning with Varying Risk Levels

Siyi Wang, Zifan Wang, Karl H. Johansson

PDF

Open Access

TL;DR

This paper develops risk-averse online optimization algorithms that adapt to changing risk levels in dynamic environments, using CVaR as the risk measure, with theoretical regret bounds and numerical validation.

Contribution

It introduces a novel risk-level variation metric and provides algorithms with regret analysis for both first- and zeroth-order settings in non-stationary environments.

Findings

01

Algorithms achieve sublinear regret bounds.

02

Methods adapt effectively to risk level changes.

03

Numerical experiments confirm theoretical results.

Abstract

In safety-critical decision-making, the environment may evolve over time, and the learner adjusts its risk level accordingly. This work investigates risk-averse online optimization in dynamic environments with varying risk levels, employing Conditional Value-at-Risk (CVaR) as the risk measure. To capture the dynamics of the environment and risk levels, we employ the function variation metric and introduce a novel risk-level variation metric. Two information settings are considered: a first-order scenario, where the learner observes both function values and their gradients; and a zeroth-order scenario, where only function evaluations are available. For both cases, we develop risk-averse learning algorithms with a limited sampling budget and analyze their dynamic regret bounds in terms of function variation, risk-level variation, and the total number of samples. The regret analysis…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Stochastic Gradient Optimization Techniques · Risk and Portfolio Optimization