Double Deep Q-Learning in Opponent Modeling

Yangtianze Tao; John Doe

arXiv:2211.15384·cs.AI·November 29, 2022

Double Deep Q-Learning in Opponent Modeling

Yangtianze Tao, John Doe

PDF

Open Access

TL;DR

This paper explores using Double Deep Q-Networks with a Mixture-of-Experts architecture for opponent modeling in multi-agent systems, demonstrating improved performance over standard DDQN in simulated environments.

Contribution

It introduces a novel combination of DDQN and Mixture-of-Experts for opponent modeling, enhancing strategy identification in multi-agent scenarios.

Findings

01

Mixture-of-Experts model outperforms DDQN in opponent strategy identification.

02

Opponent modeling improves agent performance in multi-agent environments.

03

The approach effectively captures diverse opponent strategies.

Abstract

Multi-agent systems in which secondary agents with conflicting agendas also alter their methods need opponent modeling. In this study, we simulate the main agent's and secondary agents' tactics using Double Deep Q-Networks (DDQN) with a prioritized experience replay mechanism. Then, under the opponent modeling setup, a Mixture-of-Experts architecture is used to identify various opponent strategy patterns. Finally, we analyze our models in two environments with several agents. The findings indicate that the Mixture-of-Experts model, which is based on opponent modeling, performs better than DDQN.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsOpinion Dynamics and Social Influence · Complex Network Analysis Techniques · Network Security and Intrusion Detection

MethodsExperience Replay · Prioritized Experience Replay