PLOTS: Procedure Learning from Observations using Subtask Structure
Tong Mu, Karan Goel, Emma Brunskill

TL;DR
This paper introduces PLOTS, a method for procedural learning from observation that efficiently constructs open-loop action plans, leveraging repeated subsequences and optimistic exploration to outperform existing approaches in speed.
Contribution
The paper presents a novel approach for procedural learning from observation that is significantly faster than policy-gradient and model-based methods, especially in structured environments.
Findings
PLOTS is about 100 times faster than policy-gradient approaches.
Explicit procedural learning improves speed over traditional methods.
Optimistic action selection enhances performance in environments with latent structure.
Abstract
In many cases an intelligent agent may want to learn how to mimic a single observed demonstrated trajectory. In this work we consider how to perform such procedural learning from observation, which could help to enable agents to better use the enormous set of video data on observation sequences. Our approach exploits the properties of this setting to incrementally build an open loop action plan that can yield the desired subsequence, and can be used in both Markov and partially observable Markov domains. In addition, procedures commonly involve repeated extended temporal action subsequences. Our method optimistically explores actions to leverage potential repeated structure in the procedure. In comparing to some state-of-the-art approaches we find that our explicit procedural learning from observation method is about 100 times faster than policy-gradient based approaches that learn a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Machine Learning and Algorithms · Machine Learning and Data Classification
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
