Discriminative Particle Filter Reinforcement Learning for Complex Partial Observations
Xiao Ma, Peter Karkus, David Hsu, Wee Sun Lee, Nan Ye

TL;DR
DPFRL introduces a neural network framework with a discriminative particle filter for improved decision making under complex partial visual observations, outperforming existing models in Atari and real-world navigation tasks.
Contribution
It presents a novel discriminative particle filter integrated into reinforcement learning for complex partial observations, with end-to-end training and new belief features.
Findings
Outperforms state-of-the-art POMDP RL models in Flickering Atari Games.
Achieves superior results in Natural Flickering Atari Games, a new challenging benchmark.
Demonstrates effective visual navigation in real-world Habitat environment.
Abstract
Deep reinforcement learning is successful in decision making for sophisticated games, such as Atari, Go, etc. However, real-world decision making often requires reasoning with partial information extracted from complex visual observations. This paper presents Discriminative Particle Filter Reinforcement Learning (DPFRL), a new reinforcement learning framework for complex partial observations. DPFRL encodes a differentiable particle filter in the neural network policy for explicit reasoning with partial observations over time. The particle filter maintains a belief using learned discriminative update, which is trained end-to-end for decision making. We show that using the discriminative update instead of standard generative models results in significantly improved performance, especially for tasks with complex visual observations, because they circumvent the difficulty of modeling…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Domain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications
