Efficiently Combining Human Demonstrations and Interventions for Safe   Training of Autonomous Systems in Real-Time

Vinicius G. Goecks; Gregory M. Gremillion; Vernon J. Lawhern; John; Valasek; Nicholas R. Waytowich

arXiv:1810.11545·cs.AI·November 30, 2018

Efficiently Combining Human Demonstrations and Interventions for Safe Training of Autonomous Systems in Real-Time

Vinicius G. Goecks, Gregory M. Gremillion, Vernon J. Lawhern, John, Valasek, Nicholas R. Waytowich

PDF

1 Repo

TL;DR

This paper presents a framework that combines human demonstrations and interventions to train autonomous systems more efficiently and safely in real-time, demonstrated on a quadrotor in simulation.

Contribution

It introduces a Cycle-of-Learning framework that integrates multiple human interaction modalities to enhance training speed and performance of autonomous agents.

Findings

01

Improved task completion with combined human interaction.

02

Achieved 32% reduction in data needed for training.

03

Enhanced safety and efficiency in real-time autonomous training.

Abstract

This paper investigates how to utilize different forms of human interaction to safely train autonomous systems in real-time by learning from both human demonstrations and interventions. We implement two components of the Cycle-of-Learning for Autonomous Systems, which is our framework for combining multiple modalities of human interaction. The current effort employs human demonstrations to teach a desired behavior via imitation learning, then leverages intervention data to correct for undesired behaviors produced by the imitation learner to teach novel tasks to an autonomous agent safely, after only minutes of training. We demonstrate this method in an autonomous perching task using a quadrotor with continuous roll, pitch, yaw, and throttle commands and imagery captured from a downward-facing camera in a high-fidelity simulated environment. Our method improves task completion…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

viniciusguigo/complete_col
tf

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings