Hierarchical RNNs-Based Transformers MADDPG for Mixed Cooperative-Competitive Environments
Xiaolong Wei, LiFang Yang, Xianglin Huang, Gang Cao, Tao Zhulin,, Zhengyang Du, Jing An

TL;DR
This paper introduces HRTMADDPG, a hierarchical transformer-based multi-agent reinforcement learning method that combines RNNs and transformers to better capture temporal and inter-sequence relationships in cooperative-competitive environments.
Contribution
It proposes a novel hierarchical model integrating RNNs and transformers for multi-agent reinforcement learning, enhancing the understanding of temporal and inter-sequence dependencies.
Findings
Improved learning efficiency in multi-agent environments.
Effective encoding of multi-step temporal information.
Enhanced correlation learning between sequences.
Abstract
At present, attention mechanism has been widely applied to the fields of deep learning models. Structural models that based on attention mechanism can not only record the relationships between features position, but also can measure the importance of different features based on their weights. By establishing dynamically weighted parameters for choosing relevant and irrelevant features, the key information can be strengthened, and the irrelevant information can be weakened. Therefore, the efficiency of deep learning algorithms can be significantly elevated and improved. Although transformers have been performed very well in many fields including reinforcement learning, there are still many problems and applications can be solved and made with transformers within this area. MARL (known as Multi-Agent Reinforcement Learning) can be recognized as a set of independent agents trying to adapt…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Data Stream Mining Techniques · Neural Networks and Applications
MethodsConvolution · Experience Replay · Dense Connections · Adam · Batch Normalization · Weight Decay · *Communicated@Fast*How Do I Communicate to Expedia? · MADDPG
