# Learning Vision-based Flight in Drone Swarms by Imitation

**Authors:** Fabian Schilling, Julien Lecoeur, Fabrizio Schiano, Dario, Floreano

arXiv: 1908.02999 · 2019-08-09

## TL;DR

This paper introduces a novel decentralized drone swarm control method using convolutional neural networks trained on visual data, enabling markerless coordination without communication or shared position data, and successfully transferring from simulation to real-world deployment.

## Contribution

It presents a vision-based, imitation learning approach for decentralized drone swarms that eliminates the need for communication or visual markers, with effective domain adaptation from simulation to real-world.

## Key findings

- Neural network learns robust collision avoidance and cohesion.
- Controller successfully transfers from simulation to real-world.
- Visual localization of agents is effective and interpretable.

## Abstract

Decentralized drone swarms deployed today either rely on sharing of positions among agents or detecting swarm members with the help of visual markers. This work proposes an entirely visual approach to coordinate markerless drone swarms based on imitation learning. Each agent is controlled by a small and efficient convolutional neural network that takes raw omnidirectional images as inputs and predicts 3D velocity commands that match those computed by a flocking algorithm. We start training in simulation and propose a simple yet effective unsupervised domain adaptation approach to transfer the learned controller to the real world. We further train the controller with data collected in our motion capture hall. We show that the convolutional neural network trained on the visual inputs of the drone can learn not only robust inter-agent collision avoidance but also cohesion of the swarm in a sample-efficient manner. The neural controller effectively learns to localize other agents in the visual input, which we show by visualizing the regions with the most influence on the motion of an agent. We remove the dependence on sharing positions among swarm members by taking only local visual information into account for control. Our work can therefore be seen as the first step towards a fully decentralized, vision-based swarm without the need for communication or visual markers.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1908.02999/full.md

## Figures

42 figures with captions in the complete paper: https://tomesphere.com/paper/1908.02999/full.md

## References

27 references — full list in the complete paper: https://tomesphere.com/paper/1908.02999/full.md

---
Source: https://tomesphere.com/paper/1908.02999