Vision-based deep execution monitoring
Francesco Puja, Simone Grazioso, Antonio Tammaro, Valsmis Ntouskos,, Marta Sanzari, Fiora Pirri

TL;DR
This paper introduces a vision-based execution monitoring system for robots in uncharted environments, utilizing deep learning for object recognition, Bayesian methods for relation discovery, and reinforcement learning for visual search to improve task reliability.
Contribution
It presents an integrated approach combining deep learning, Bayesian inference, and reinforcement learning for robust visual execution monitoring in unstructured environments.
Findings
Effective object recognition with DCNNs.
Successful relation discovery using non-parametric Bayes.
Improved recovery from failures through learned visual search policies.
Abstract
Execution monitor of high-level robot actions can be effectively improved by visual monitoring the state of the world in terms of preconditions and postconditions that hold before and after the execution of an action. Furthermore a policy for searching where to look at, either for verifying the relations that specify the pre and postconditions or to refocus in case of a failure, can tremendously improve the robot execution in an uncharted environment. It is now possible to strongly rely on visual perception in order to make the assumption that the environment is observable, by the amazing results of deep learning. In this work we present visual execution monitoring for a robot executing tasks in an uncharted Lab environment. The execution monitor interacts with the environment via a visual stream that uses two DCNN for recognizing the objects the robot has to deal with and manipulate,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobot Manipulation and Learning · Domain Adaptation and Few-Shot Learning · Advanced Neural Network Applications
