Unsupervised Video Object Segmentation for Deep Reinforcement Learning
Vik Goel, Jameson Weng, Pascal Poupart

TL;DR
This paper introduces MOREL, an unsupervised method that detects and segments moving objects in video to improve deep reinforcement learning policies, making them more interpretable and efficient.
Contribution
The paper presents a novel unsupervised approach for object detection in reinforcement learning, leveraging motion cues to enhance policy learning and interpretability.
Findings
Reduced environment interactions needed for good policies.
Enhanced interpretability of learned policies.
Effective detection of relevant moving objects in Atari games.
Abstract
We present a new technique for deep reinforcement learning that automatically detects moving objects and uses the relevant information for action selection. The detection of moving objects is done in an unsupervised way by exploiting structure from motion. Instead of directly learning a policy from raw images, the agent first learns to detect and segment moving objects by exploiting flow information in video sequences. The learned representation is then used to focus the policy of the agent on the moving objects. Over time, the agent identifies which objects are critical for decision making and gradually builds a policy based on relevant moving objects. This approach, which we call Motion-Oriented REinforcement Learning (MOREL), is demonstrated on a suite of Atari games where the ability to detect moving objects reduces the amount of interaction needed with the environment to obtain a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Human Pose and Action Recognition · Anomaly Detection Techniques and Applications
