Tracking Moving Agents via Inexact Online Gradient Descent Algorithm

Amrit Singh Bedi; Paban Sarma; and Ketan Rajawat

arXiv:1710.05133·math.OC·November 30, 2017·IEEE J. Sel. Top. Signal Process.

Tracking Moving Agents via Inexact Online Gradient Descent Algorithm

Amrit Singh Bedi, Paban Sarma, and Ketan Rajawat

PDF

Open Access

TL;DR

This paper introduces an inexact online gradient descent algorithm for tracking moving agents in multi-agent systems, providing improved regret bounds and demonstrating its effectiveness on tracking and recommendation tasks.

Contribution

It proposes a novel inexact online gradient descent method with enhanced regret analysis for time-varying optimization in multi-agent tracking, accommodating non-strongly convex costs and gradient errors.

Findings

01

Improved regret rates for non-strongly convex functions.

02

Effective multi-agent multi-target tracking results.

03

Versatility demonstrated through online movie recommendation.

Abstract

Multi-agent systems are being increasingly deployed in challenging environments for performing complex tasks such as multi-target tracking, search-and-rescue, and intrusion detection. Notwithstanding the computational limitations of individual robots, such systems rely on collaboration to sense and react to the environment. This paper formulates the generic target tracking problem as a time-varying optimization problem and puts forth an inexact online gradient descent method for solving it sequentially. The performance of the proposed algorithm is studied by characterizing its dynamic regret, a notion common to the online learning literature. Building upon the existing results, we provide improved regret rates that not only allow non-strongly convex costs but also explicating the role of the cumulative gradient error. Two distinct classes of problems are considered: one in which the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Distributed Control Multi-Agent Systems · Stochastic Gradient Optimization Techniques