Coordinated Online Learning for Multi-Agent Systems with Coupled   Constraints and Perturbed Utility Observations

Ezra Tampubolon; Holger Boche

arXiv:2010.10878·math.OC·October 22, 2020

Coordinated Online Learning for Multi-Agent Systems with Coupled Constraints and Perturbed Utility Observations

Ezra Tampubolon, Holger Boche

PDF

TL;DR

This paper introduces a decentralized online learning method for multi-agent systems with coupled resource constraints, ensuring convergence to equilibrium despite noisy utility feedback, applicable to large-scale resource allocation problems.

Contribution

A novel decentralized resource pricing algorithm that guarantees convergence to a generalized Nash equilibrium under noisy feedback conditions.

Findings

01

Almost sure convergence to generalized Nash equilibrium

02

Resource constraints are asymptotically satisfied

03

Finite-time bounds on resource constraint violations

Abstract

Competitive non-cooperative online decision-making agents whose actions increase congestion of scarce resources constitute a model for widespread modern large-scale applications. To ensure sustainable resource behavior, we introduce a novel method to steer the agents toward a stable population state, fulfilling the given coupled resource constraints. The proposed method is a decentralized resource pricing method based on the resource loads resulting from the augmentation of the game's Lagrangian. Assuming that the online learning agents have only noisy first-order utility feedback, we show that for a polynomially decaying agents' step size/learning rate, the population's dynamic will almost surely converge to generalized Nash equilibrium. A particular consequence of the latter is the fulfillment of resource constraints in the asymptotic limit. Moreover, we investigate the finite-time…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.