How hard is it to cross the room? -- Training (Recurrent) Neural   Networks to steer a UAV

Klaas Kelchtermans; Tinne Tuytelaars

arXiv:1702.07600·cs.CV·February 27, 2017·30 cites

How hard is it to cross the room? -- Training (Recurrent) Neural Networks to steer a UAV

Klaas Kelchtermans, Tinne Tuytelaars

PDF

Open Access

TL;DR

This paper investigates training recurrent neural networks, specifically LSTM, for UAV navigation using vision, proposing methods to improve training and demonstrating successful crossing of a room with obstacles in simulation.

Contribution

It introduces the use of RNNs for UAV control, proposes WW-TBPTT for training, and compares partial versus end-to-end training approaches for navigation tasks.

Findings

01

LSTM networks can be successfully trained for UAV navigation.

02

Window-wise truncated backpropagation through time improves training stability.

03

Partial retraining of control layers is effective for navigation tasks.

Abstract

This work explores the feasibility of steering a drone with a (recurrent) neural network, based on input from a forward looking camera, in the context of a high-level navigation task. We set up a generic framework for training a network to perform navigation tasks based on imitation learning. It can be applied to both aerial and land vehicles. As a proof of concept we apply it to a UAV (Unmanned Aerial Vehicle) in a simulated environment, learning to cross a room containing a number of obstacles. So far only feedforward neural networks (FNNs) have been used to train UAV control. To cope with more complex tasks, we propose the use of recurrent neural networks (RNN) instead and successfully train an LSTM (Long-Short Term Memory) network for controlling UAVs. Vision based control is a sequential prediction problem, known for its highly correlated input data. The correlation makes training…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · Robotics and Sensor-Based Localization · Robotic Path Planning Algorithms

MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory