Partner Approximating Learners (PAL): Simulation-Accelerated Learning   with Explicit Partner Modeling in Multi-Agent Domains

Florian K\"opf; Alexander Nitsch; Michael Flad; S\"oren Hohmann

arXiv:1909.03868·eess.SY·March 3, 2020

Partner Approximating Learners (PAL): Simulation-Accelerated Learning with Explicit Partner Modeling in Multi-Agent Domains

Florian K\"opf, Alexander Nitsch, Michael Flad, S\"oren Hohmann

PDF

TL;DR

This paper introduces Partner Approximating Learners (PAL), a framework that accelerates reinforcement learning in multi-agent settings by modeling partners explicitly and using simulation to reduce real-world data needs.

Contribution

The paper presents a novel framework combining online partner modeling with simulation-accelerated reinforcement learning for improved multi-agent collaboration.

Findings

01

Fast learning due to simulation environment

02

Effective adaptation to changing partners

03

Reduced real-world data requirements

Abstract

Mixed cooperative-competitive control scenarios such as human-machine interaction with individual goals of the interacting partners are very challenging for reinforcement learning agents. In order to contribute towards intuitive human-machine collaboration, we focus on problems in the continuous state and control domain where no explicit communication is considered and the agents do not know the others' goals or control laws but only sense their control inputs retrospectively. Our proposed framework combines a learned partner model based on online data with a reinforcement learning agent that is trained in a simulated environment including the partner model. Thus, we overcome drawbacks of independent learners and, in addition, benefit from a reduced amount of real world data required for reinforcement learning which is vital in the human-machine context. We finally analyze an example…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.