Maximum Entropy Dueling Network Architecture in Atari Domain

Alireza Nadali; Mohammad Mehdi Ebadzadeh

arXiv:2107.14457·cs.LG·April 28, 2022

Maximum Entropy Dueling Network Architecture in Atari Domain

Alireza Nadali, Mohammad Mehdi Ebadzadeh

PDF

Open Access

TL;DR

This paper introduces an enhanced deep reinforcement learning architecture for Atari games that combines Dueling Networks with Maximum Entropy principles, leading to improved policy evaluation.

Contribution

It presents a novel architecture integrating Maximum Entropy with Dueling Networks, enhancing value estimation in Atari domain reinforcement learning.

Findings

01

Better policy evaluation performance in Atari games

02

Outperforms original Dueling Networks and other value-based methods

03

Demonstrates the effectiveness of Maximum Entropy integration

Abstract

In recent years, there have been many deep structures for Reinforcement Learning, mainly for value function estimation and representations. These methods achieved great success in Atari 2600 domain. In this paper, we propose an improved architecture based upon Dueling Networks, in this architecture, there are two separate estimators, one approximate the state value function and the other, state advantage function. This improvement based on Maximum Entropy, shows better policy evaluation compared to the original network and other value-based architectures in Atari domain.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsViral Infectious Diseases and Gene Expression in Insects · Reinforcement Learning in Robotics · Evolutionary Algorithms and Applications