How hard is it to cross the room? -- Training (Recurrent) Neural Networks to steer a UAV
Klaas Kelchtermans, Tinne Tuytelaars

TL;DR
This paper investigates training recurrent neural networks, specifically LSTM, for UAV navigation using vision, proposing methods to improve training and demonstrating successful crossing of a room with obstacles in simulation.
Contribution
It introduces the use of RNNs for UAV control, proposes WW-TBPTT for training, and compares partial versus end-to-end training approaches for navigation tasks.
Findings
LSTM networks can be successfully trained for UAV navigation.
Window-wise truncated backpropagation through time improves training stability.
Partial retraining of control layers is effective for navigation tasks.
Abstract
This work explores the feasibility of steering a drone with a (recurrent) neural network, based on input from a forward looking camera, in the context of a high-level navigation task. We set up a generic framework for training a network to perform navigation tasks based on imitation learning. It can be applied to both aerial and land vehicles. As a proof of concept we apply it to a UAV (Unmanned Aerial Vehicle) in a simulated environment, learning to cross a room containing a number of obstacles. So far only feedforward neural networks (FNNs) have been used to train UAV control. To cope with more complex tasks, we propose the use of recurrent neural networks (RNN) instead and successfully train an LSTM (Long-Short Term Memory) network for controlling UAVs. Vision based control is a sequential prediction problem, known for its highly correlated input data. The correlation makes training…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Robotics and Sensor-Based Localization · Robotic Path Planning Algorithms
MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory
