Unconstrained Robust Online Convex Optimization
Jiujia Zhang, Ashok Cutkosky

TL;DR
This paper develops algorithms for online convex optimization that are robust to corrupted gradient feedback in an unconstrained setting, achieving low regret despite arbitrary corruptions.
Contribution
It introduces new algorithms with regret guarantees in unconstrained online learning under arbitrary gradient corruptions, a setting previously lacking robust solutions.
Findings
Regret bound of $ orm{u}G ( oot T + k)$ when $G$ is known.
Additional penalty of $( orm{u}^2 + G^2)k$ when $G$ is unknown.
Algorithms are effective against outliers, mislabeled data, and malicious interference.
Abstract
This paper addresses online learning with ``corrupted'' feedback. Our learner is provided with potentially corrupted gradients instead of the ``true'' gradients . We make no assumptions about how the corruptions arise: they could be the result of outliers, mislabeled data, or even malicious interference. We focus on the difficult ``unconstrained'' setting in which our algorithm must maintain low regret with respect to any comparison point . The unconstrained setting is significantly more challenging as existing algorithms suffer extremely high regret even with very tiny amounts of corruption (which is not true in the case of a bounded domain). Our algorithms guarantee regret when is known, where is a measure of the total amount of corruption. When is unknown we incur an extra additive penalty…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Advanced Wireless Network Optimization · Optimization and Search Problems
MethodsFocus
