Decentralized Online Convex Optimization with Unknown Feedback Delays

Hao Qiu (UNIMI); Mengxiao Zhang; Juliette Achddou (CRIStAL)

arXiv:2601.07901·stat.ML·January 14, 2026

Decentralized Online Convex Optimization with Unknown Feedback Delays

Hao Qiu (UNIMI), Mengxiao Zhang, Juliette Achddou (CRIStAL)

PDF

Open Access 1 Video

TL;DR

This paper introduces a decentralized online convex optimization algorithm that adaptively handles unknown, varying feedback delays among agents, improving regret bounds and performance in networked learning scenarios.

Contribution

It proposes a novel delay-adaptive algorithm with decentralized delay estimation, achieving tighter regret bounds without prior delay knowledge, applicable to both convex and strongly convex settings.

Findings

01

Achieves regret bound of O(N√d_tot + N√T(1-σ2)^{1/4})

02

Extends to strongly convex functions with sharper regret bounds

03

Experimental results show improved performance over benchmarks

Abstract

Decentralized online convex optimization (D-OCO), where multiple agents within a network collaboratively learn optimal decisions in real-time, arises naturally in applications such as federated learning, sensor networks, and multi-agent control. In this paper, we study D-OCO under unknown, time-and agent-varying feedback delays. While recent work has addressed this problem (Nguyen et al., 2024), existing algorithms assume prior knowledge of the total delay over agents and still suffer from suboptimal dependence on both the delay and network parameters. To overcome these limitations, we propose a novel algorithm that achieves an improved regret bound of O N $\sqrt$ d tot + N $\sqrt$ T (1- $σ$ 2) 1/4 , where T is the total horizon, d tot denotes the average total delay across agents, N is the number of agents, and 1 - $σ$ 2 is the spectral gap of the network. Our approach builds…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Decentralized Online Convex Optimization with Unknown Feedback Delays· underline

Taxonomy

TopicsDistributed Control Multi-Agent Systems · Advanced Bandit Algorithms Research · Stochastic Gradient Optimization Techniques