Non-Stationary Policy Learning for Multi-Timescale Multi-Agent   Reinforcement Learning

Patrick Emami; Xiangyu Zhang; David Biagioni; Ahmed S. Zamzam

arXiv:2307.08794·cs.LG·July 19, 2023

Non-Stationary Policy Learning for Multi-Timescale Multi-Agent Reinforcement Learning

Patrick Emami, Xiangyu Zhang, David Biagioni, Ahmed S. Zamzam

PDF

Open Access

TL;DR

This paper introduces a simple framework for learning non-stationary, multi-timescale policies in multi-agent reinforcement learning using periodic time encoding and phase-functioned neural networks, validated on gridworld and energy management tasks.

Contribution

It proposes a novel approach leveraging periodic encoding and phase-functioned neural networks to effectively learn non-stationary policies in multi-timescale MARL.

Findings

01

Successfully learned policies in gridworld environment.

02

Effective energy management policy in building environment.

03

Demonstrated theoretical learnability of non-stationary policies.

Abstract

In multi-timescale multi-agent reinforcement learning (MARL), agents interact across different timescales. In general, policies for time-dependent behaviors, such as those induced by multiple timescales, are non-stationary. Learning non-stationary policies is challenging and typically requires sophisticated or inefficient algorithms. Motivated by the prevalence of this control problem in real-world complex systems, we introduce a simple framework for learning non-stationary policies for multi-timescale MARL. Our approach uses available information about agent timescales to define a periodic time encoding. In detail, we theoretically demonstrate that the effects of non-stationarity introduced by multiple timescales can be learned by a periodic multi-agent policy. To learn such policies, we propose a policy gradient algorithm that parameterizes the actor and critic with phase-functioned…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics