Dynamic Deep Factor Graph for Multi-Agent Reinforcement Learning
Yuchen Shi, Shihong Duan, Cheng Xu, Ran Wang, Fangwen Ye, Chau Yuen

TL;DR
This paper presents Dynamic Deep Factor Graphs (DDFG), a novel value decomposition method for multi-agent reinforcement learning that dynamically generates factor graph structures to improve coordination and performance.
Contribution
DDFG introduces a flexible, on-the-fly graph structure generation policy for value decomposition in MARL, enhancing adaptability and efficiency over traditional methods.
Findings
DDFG outperforms existing algorithms in complex tasks.
DDFG effectively balances computational cost and decomposition accuracy.
Empirical validation on predator-prey and StarCraft II tasks demonstrates robustness.
Abstract
This work introduces a novel value decomposition algorithm, termed \textit{Dynamic Deep Factor Graphs} (DDFG). Unlike traditional coordination graphs, DDFG leverages factor graphs to articulate the decomposition of value functions, offering enhanced flexibility and adaptability to complex value function structures. Central to DDFG is a graph structure generation policy that innovatively generates factor graph structures on-the-fly, effectively addressing the dynamic collaboration requirements among agents. DDFG strikes an optimal balance between the computational overhead associated with aggregating value functions and the performance degradation inherent in their complete decomposition. Through the application of the max-sum algorithm, DDFG efficiently identifies optimal policies. We empirically validate DDFG's efficacy in complex scenarios, including higher-order predator-prey tasks…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics
