Non-Linear Coordination Graphs
Yipeng Kang, Tonghan Wang, Xiaoran Wu, Qianlan Yang, Chongjie Zhang

TL;DR
This paper introduces a novel non-linear coordination graph approach for multi-agent reinforcement learning, enhancing the representational capacity beyond linear decompositions and demonstrating superior performance on complex tasks.
Contribution
It extends coordination graphs to a non-linear setting using mixing networks with LeakyReLU, addressing greedy action selection challenges with new optimization methods.
Findings
Achieves better performance on multi-agent coordination tasks like MACO.
Proposes an enumeration method with global optimality guarantees.
Develops an efficient iterative optimization with local optimality guarantees.
Abstract
Value decomposition multi-agent reinforcement learning methods learn the global value function as a mixing of each agent's individual utility functions. Coordination graphs (CGs) represent a higher-order decomposition by incorporating pairwise payoff functions and thus is supposed to have a more powerful representational capacity. However, CGs decompose the global value function linearly over local value functions, severely limiting the complexity of the value function class that can be represented. In this paper, we propose the first non-linear coordination graph by extending CG value decomposition beyond the linear case. One major challenge is to conduct greedy action selections in this new function class to which commonly adopted DCOP algorithms are no longer applicable. We study how to solve this problem when mixing networks with LeakyReLU activation are used. An enumeration method…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsReinforcement Learning in Robotics
