Visual search and recognition for robot task execution and monitoring
Lorenzo Mauro, Francesco Puja, Simone Grazioso, Valsamis Ntouskos,, Marta Sanzari, Edoardo Alati, Fiora Pirri

TL;DR
This paper introduces a vision-based execution monitoring framework for robots that combines deep reinforcement learning and convolutional networks to improve task execution and failure recovery.
Contribution
It presents a novel framework integrating deep learning and classical planning for robot task monitoring and failure recovery.
Findings
Robot can complete simple tasks autonomously.
Framework enables recovery from failures.
Effective in visual search and environment understanding.
Abstract
Visual search of relevant targets in the environment is a crucial robot skill. We propose a preliminary framework for the execution monitor of a robot task, taking care of the robot attitude to visually searching the environment for targets involved in the task. Visual search is also relevant to recover from a failure. The framework exploits deep reinforcement learning to acquire a "common sense" scene structure and it takes advantage of a deep convolutional network to detect objects and relevant relations holding between them. The framework builds on these methods to introduce a vision-based execution monitoring, which uses classical planning as a backbone for task execution. Experiments show that with the proposed vision-based execution monitor the robot can complete simple tasks and can recover from failures in autonomy.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Robot Manipulation and Learning · Robotic Path Planning Algorithms
