Pathfinding in Random Partially Observable Environments with   Vision-Informed Deep Reinforcement Learning

Anthony Dowling

arXiv:2209.04801·cs.LG·September 13, 2022·1 cites

Pathfinding in Random Partially Observable Environments with Vision-Informed Deep Reinforcement Learning

Anthony Dowling

PDF

Open Access

TL;DR

This paper demonstrates that a Deep Q-Network (DQN) can effectively perform pathfinding in partially observable environments using visual inputs, outperforming recurrent models like DQN-GRU and DQN-LSTM under similar conditions.

Contribution

It introduces a comparison of DQN, DQN-GRU, and DQN-LSTM for visual pathfinding in partially observable environments, showing DQN's superior performance.

Findings

01

DQN outperforms recurrent models in pathfinding tasks.

02

Visual input-based DQN can operate effectively in partial observability.

03

Recurrent models do not necessarily improve performance in this setting.

Abstract

Deep reinforcement learning is a technique for solving problems in a variety of environments, ranging from Atari video games to stock trading. This method leverages deep neural network models to make decisions based on observations of a given environment with the goal of maximizing a reward function that can incorporate cost and rewards for reaching goals. With the aim of pathfinding, reward conditions can include reaching a specified target area along with costs for movement. In this work, multiple Deep Q-Network (DQN) agents are trained to operate in a partially observable environment with the goal of reaching a target zone in minimal travel time. The agent operates based on a visual representation of its surroundings, and thus has a restricted capability to observe the environment. A comparison between DQN, DQN-GRU, and DQN-LSTM is performed to examine each models capabilities with…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Artificial Intelligence in Games

MethodsEmirates Airlines Office in Dubai · Q-Learning · Dense Connections · Convolution · Deep Q-Network