SIPOMDPLite-Net: Lightweight, Self-Interested Learning and Planning in   POSGs with Sparse Interactions

Gengyu Zhang; Prashant Doshi

arXiv:2202.11188·cs.MA·February 24, 2022

SIPOMDPLite-Net: Lightweight, Self-Interested Learning and Planning in POSGs with Sparse Interactions

Gengyu Zhang, Prashant Doshi

PDF

Open Access

TL;DR

sIPOMDPLite-net is a lightweight neural network that enables decentralized, self-interested agents to learn and plan in multiagent environments with sparse interactions, demonstrating good transferability and near-optimal performance.

Contribution

The paper introduces sIPOMDPLite-net, a novel neural network architecture that models self-interested agent planning in POSGs using hierarchical value iteration, with effective transfer to larger and real-world scenarios.

Findings

01

Accurately learns I-POMDP Lite models from expert demonstrations.

02

Performs well on larger grids and real-world maps.

03

Offers a lighter alternative for multiagent planning.

Abstract

This work introduces sIPOMDPLite-net, a deep neural network (DNN) architecture for decentralized, self-interested agent control in partially observable stochastic games (POSGs) with sparse interactions between agents. The network learns to plan in contexts modeled by the interactive partially observable Markov decision process (I-POMDP) Lite framework and uses hierarchical value iteration networks to simulate the solution of nested MDPs, which I-POMDP Lite attributes to the other agent to model its behavior and predict its intention. We train sIPOMDPLite-net with expert demonstrations on small two-agent Tiger-grid tasks, for which it accurately learns the underlying I-POMDP Lite model and near-optimal policy, and the policy continues to perform well on larger grids and real-world maps. As such, sIPOMDPLite-net shows good transfer capabilities and offers a lighter learning and planning…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Reinforcement Learning in Robotics · Decision-Making and Behavioral Economics