Discriminative Particle Filter Reinforcement Learning for Complex   Partial Observations

Xiao Ma; Peter Karkus; David Hsu; Wee Sun Lee; Nan Ye

arXiv:2002.09884·cs.LG·February 25, 2020·23 cites

Discriminative Particle Filter Reinforcement Learning for Complex Partial Observations

Xiao Ma, Peter Karkus, David Hsu, Wee Sun Lee, Nan Ye

PDF

Open Access 1 Repo

TL;DR

DPFRL introduces a neural network framework with a discriminative particle filter for improved decision making under complex partial visual observations, outperforming existing models in Atari and real-world navigation tasks.

Contribution

It presents a novel discriminative particle filter integrated into reinforcement learning for complex partial observations, with end-to-end training and new belief features.

Findings

01

Outperforms state-of-the-art POMDP RL models in Flickering Atari Games.

02

Achieves superior results in Natural Flickering Atari Games, a new challenging benchmark.

03

Demonstrates effective visual navigation in real-world Habitat environment.

Abstract

Deep reinforcement learning is successful in decision making for sophisticated games, such as Atari, Go, etc. However, real-world decision making often requires reasoning with partial information extracted from complex visual observations. This paper presents Discriminative Particle Filter Reinforcement Learning (DPFRL), a new reinforcement learning framework for complex partial observations. DPFRL encodes a differentiable particle filter in the neural network policy for explicit reasoning with partial observations over time. The particle filter maintains a belief using learned discriminative update, which is trained end-to-end for decision making. We show that using the discriminative update instead of standard generative models results in significantly improved performance, especially for tasks with complex visual observations, because they circumvent the difficulty of modeling…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Yusufma03/DPFRL
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Domain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications