An active learning method for solving competitive multi-agent   decision-making and control problems

Filippo Fabiani; Alberto Bemporad

arXiv:2212.12561·eess.SY·October 10, 2024

An active learning method for solving competitive multi-agent decision-making and control problems

Filippo Fabiani, Alberto Bemporad

PDF

Open Access 1 Repo

TL;DR

This paper presents an active learning approach for identifying stationary action profiles in competitive multi-agent systems by probing agents' reactions and updating local estimates, with theoretical guarantees and numerical validation.

Contribution

It introduces a novel active learning scheme that does not assume the existence of stationary profiles and provides conditions for convergence and existence.

Findings

01

The method effectively identifies stationary profiles in multi-agent control problems.

02

Theoretical conditions guarantee asymptotic convergence of the learning scheme.

03

Numerical simulations demonstrate practical applicability and robustness.

Abstract

To identify a stationary action profile for a population of competitive agents, each executing private strategies, we introduce a novel active-learning scheme where a centralized external observer (or entity) can probe the agents' reactions and recursively update simple local parametric estimates of the action-reaction mappings. Under very general working assumptions (not even assuming that a stationary profile exists), sufficient conditions are established to assess the asymptotic properties of the proposed active learning methodology so that, if the parameters characterizing the action-reaction mappings converge, a stationary action profile is achieved. Such conditions hence act also as certificates for the existence of such a profile. Extensive numerical simulations involving typical competitive multi-agent control and decision-making problems illustrate the practical effectiveness…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

bemporad/gnep-learn
jaxOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAuction Theory and Applications