CLASP: An online learning algorithm for Convex Losses And Squared Penalties

Ricardo N. Ferreira; Jo\~ao Xavier; Cl\'audia Soares

arXiv:2601.16072·cs.LG·January 27, 2026

CLASP: An online learning algorithm for Convex Losses And Squared Penalties

Ricardo N. Ferreira, Jo\~ao Xavier, Cl\'audia Soares

PDF

Open Access 3 Reviews

TL;DR

This paper introduces CLASP, an online learning algorithm for convex optimization with constraints, achieving optimal regret and penalty bounds, especially logarithmic guarantees in strongly convex scenarios.

Contribution

The paper presents CLASP, a novel algorithm that effectively minimizes loss and constraint violations using a new proof strategy based on convex projectors.

Findings

01

Achieves regret of $O(T^{eta})$ for convex losses.

02

Provides the first logarithmic bounds for strongly convex problems.

03

Ensures bounded cumulative squared penalties with $O( ext{log } T)$.

Abstract

We study Constrained Online Convex Optimization (COCO), where a learner chooses actions iteratively, observes both unanticipated convex loss and convex constraint, and accumulates loss while incurring penalties for constraint violations. We introduce CLASP (Convex Losses And Squared Penalties), an algorithm that minimizes cumulative loss together with squared constraint violations. Our analysis departs from prior work by fully leveraging the firm non-expansiveness of convex projectors, a proof strategy not previously applied in this setting. For convex losses, CLASP achieves regret $O (T^{m a x {β, 1 - β}})$ and cumulative squared penalty $O (T^{1 - β})$ for any $β \in (0, 1)$ . Most importantly, for strongly convex problems, CLASP provides the first logarithmic guarantees on both regret and cumulative squared penalty. In the strongly convex case, the regret…

Peer Reviews

Decision·Submitted to ICLR 2026

Reviewer 01Rating 4Confidence 3

Strengths

The paper is clearly written and easy to follow (e.g., all assumptions and lemmas are clearly stated). The structure is standard and logical. The logarithmic bound for the strongly convex case is a nice result. The experiments use relevant baselines on synthetic and real-world tasks and the results show that CLASP performs competitively.

Weaknesses

The proposed algorithm is essentially a standard projected gradient method for COCO extended with time-varying constraints: at each round, it performs one gradient step on the loss followed by projection onto the time-varying constraint set. The only new element is the squared penalty measure $CCV_{T,2}$, but it was already analyzed in prior work [Yuan & Lamperski, 2018] for static constraints. The claimed novelty that leveraging FNE in the analysis is mainly a technical proof refinement rather

Reviewer 02Rating 4Confidence 3

Strengths

+ This paper studies constrained online convex optimization (COCO) with adversarial varying constraints, evaluated by static regret and the cumulative squared violation. It provides theoretical guarantees in the both convex and strongly convex regime.

Weaknesses

- The paper does not justify why the squared violation metric is necessary or reasonable in the dynamic setting ($CCV_{T,1}$ seems more reasonable), nor does it provide a clear comparison against the hard violation $CCV_{T,1}$ results. By Cauchy-Schwarz, $CCV_{T,1} \leq \sqrt{T \cdot CCV_{T,2}}$. Hence in the convex setting, the theoretical guarantees on $CCV_{T,2}$ translate to $CCV_{T,1}$ that are strictly weaker than recent bounds (e.g., Sinha and Vaze, 2024). Even under strong convexity, the

Reviewer 03Rating 2Confidence 3

Strengths

This paper is clearly written, with a thorough review and discussion of related work. After reading the entire paper, I have gained a comprehensive understanding of COCO.

Weaknesses

The main weakness of this paper lies in its limited technical novelty. Extending the previous result on squared constraint violations from static to dynamic constraints does not appear to involve substantial technical challenges. In my view, as long as the constraints satisfy Assumption 3, Lemma 3 always holds, making the derivation of the $\text{CCV}_{T,2}$ bound rather straightforward. Therefore, this work seems to be a combination of existing techniques without introducing new technical tools

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Stochastic Gradient Optimization Techniques · Optimization and Search Problems