Tracking Moving Agents via Inexact Online Gradient Descent Algorithm
Amrit Singh Bedi, Paban Sarma, and Ketan Rajawat

TL;DR
This paper introduces an inexact online gradient descent algorithm for tracking moving agents in multi-agent systems, providing improved regret bounds and demonstrating its effectiveness on tracking and recommendation tasks.
Contribution
It proposes a novel inexact online gradient descent method with enhanced regret analysis for time-varying optimization in multi-agent tracking, accommodating non-strongly convex costs and gradient errors.
Findings
Improved regret rates for non-strongly convex functions.
Effective multi-agent multi-target tracking results.
Versatility demonstrated through online movie recommendation.
Abstract
Multi-agent systems are being increasingly deployed in challenging environments for performing complex tasks such as multi-target tracking, search-and-rescue, and intrusion detection. Notwithstanding the computational limitations of individual robots, such systems rely on collaboration to sense and react to the environment. This paper formulates the generic target tracking problem as a time-varying optimization problem and puts forth an inexact online gradient descent method for solving it sequentially. The performance of the proposed algorithm is studied by characterizing its dynamic regret, a notion common to the online learning literature. Building upon the existing results, we provide improved regret rates that not only allow non-strongly convex costs but also explicating the role of the cumulative gradient error. Two distinct classes of problems are considered: one in which the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Distributed Control Multi-Agent Systems · Stochastic Gradient Optimization Techniques
