A Tensor Network Implementation of Multi Agent Reinforcement Learning
Sunny Howard

TL;DR
This paper introduces a tensor network approach to multi-agent reinforcement learning, enabling efficient representation and optimization of expected returns in multi-agent environments, demonstrated on a two-agent example.
Contribution
It applies tensor network techniques to multi-agent RL, addressing the curse of dimensionality and demonstrating effective optimization and tensor reduction methods.
Findings
Successfully optimized policies using DMRG technique.
Achieved 97.5% reduction in tensor elements without information loss.
Validated approach on a two-agent random walker example.
Abstract
Recently it has been shown that tensor networks (TNs) have the ability to represent the expected return of a single-agent finite Markov decision process (FMDP). The TN represents a distribution model, where all possible trajectories are considered. When extending these ideas to a multi-agent setting, distribution models suffer from the curse of dimensionality: the exponential relation between the number of possible trajectories and the number of agents. The key advantage of using TNs in this setting is that there exists a large number of established optimisation and decomposition techniques that are specific to TNs, that one can apply to ensure the most efficient representation is found. In this report, these methods are used to form a TN that represents the expected return of a multi-agent reinforcement learning (MARL) task. This model is then applied to a 2 agent random walker…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Smart Grid Energy Management
