Deep Learning of Robotic Tasks without a Simulator using Strong and Weak   Human Supervision

Bar Hilleli; Ran El-Yaniv

arXiv:1612.01086·cs.AI·March 28, 2017·2 cites

Deep Learning of Robotic Tasks without a Simulator using Strong and Weak Human Supervision

Bar Hilleli, Ran El-Yaniv

PDF

Open Access

TL;DR

This paper introduces a comprehensive deep learning framework for training autonomous highway steering agents directly from human supervision without relying on a simulator, emphasizing safety and multi-stage learning.

Contribution

It presents a novel multi-stage training scheme combining unsupervised, supervised, and reinforcement learning, including a safety network to prevent catastrophic errors.

Findings

01

Successfully trained an autonomous highway steering agent without a simulator

02

Demonstrated the importance of the last four training elements for vision-based control

03

Introduced a safety network to prevent catastrophic mistakes during reinforcement learning

Abstract

We propose a scheme for training a computerized agent to perform complex human tasks such as highway steering. The scheme is designed to follow a natural learning process whereby a human instructor teaches a computerized trainee. The learning process consists of five elements: (i) unsupervised feature learning; (ii) supervised imitation learning; (iii) supervised reward induction; (iv) supervised safety module construction; and (v) reinforcement learning. We implemented the last four elements of the scheme using deep convolutional networks and applied it to successfully create a computerized agent capable of autonomous highway steering over the well-known racing game Assetto Corsa. We demonstrate that the use of the last four elements is essential to effectively carry out the steering task using vision alone, without access to a driving simulator internals, and operating in wall-clock…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobot Manipulation and Learning · Reinforcement Learning in Robotics · Advanced Neural Network Applications