Peer Learning: Learning Complex Policies in Groups from Scratch via   Action Recommendations

Cedric Derstroff; Mattia Cerrato; Jannis Brugger; Jan Peters and; Stefan Kramer

arXiv:2312.09950·cs.LG·May 7, 2024·1 cites

Peer Learning: Learning Complex Policies in Groups from Scratch via Action Recommendations

Cedric Derstroff, Mattia Cerrato, Jannis Brugger, Jan Peters and, Stefan Kramer

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces peer learning, a reinforcement learning framework where groups of agents learn complex policies together through limited communication, outperforming traditional single-agent methods and baselines in various domains.

Contribution

It formalizes peer learning as a multi-armed bandit problem for teacher selection and demonstrates its effectiveness in learning complex policies in both discrete and continuous environments.

Findings

01

Peer learning outperforms single-agent and baseline methods.

02

Agents can rank peers' performance and identify reliable advice.

03

Complex policies can evolve from action recommendations beyond discrete actions.

Abstract

Peer learning is a novel high-level reinforcement learning framework for agents learning in groups. While standard reinforcement learning trains an individual agent in trial-and-error fashion, all on its own, peer learning addresses a related setting in which a group of agents, i.e., peers, learns to master a task simultaneously together from scratch. Peers are allowed to communicate only about their own states and actions recommended by others: "What would you do in my situation?". Our motivation is to study the learning behavior of these agents. We formalize the teacher selection process in the action advice setting as a multi-armed bandit problem and therefore highlight the need for exploration. Eventually, we analyze the learning behavior of the peers and observe their ability to rank the agents' performance within the study group and understand which agents give reliable advice.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

kramerlab/peerlearning
noneOfficial

Videos

Peer Learning: Learning Complex Policies in Groups from Scratch via Action Recommendations· underline

Taxonomy

TopicsReinforcement Learning in Robotics · Experimental Behavioral Economics Studies · Game Theory and Applications