Self-supervised Monocular Multi-robot Relative Localization with Efficient Deep Neural Networks
Shushuai Li, Christophe De Wagter, Guido C. H. E. de Croon

TL;DR
This paper introduces a self-supervised deep learning framework enabling tiny flying robots to perform monocular relative localization without external groundtruth data, using onboard UWB-based labels for training.
Contribution
The novel framework allows autonomous, scalable, and distributed monocular relative localization for multi-robot systems without manual labeling or external groundtruth.
Findings
Effective localization demonstrated on real quadrotors
Self-supervised training achieves accurate relative positioning
Open-source simulation pipeline supports training and testing
Abstract
Relative localization is an important ability for multiple robots to perform cooperative tasks in GPS-denied environment. This paper presents a novel autonomous positioning framework for monocular relative localization of multiple tiny flying robots. This approach does not require any groundtruth data from external systems or manual labelling. Instead, the proposed framework is able to label real-world images with 3D relative positions between robots based on another onboard relative estimation technology, using ultra-wide band (UWB). After training in this self-supervised manner, the proposed deep neural network (DNN) can predict relative positions of peer robots by purely using a monocular camera. This deep learning-based visual relative localization is scalable, distributed and autonomous. We also built an open-source and light-weight simulation pipeline by using Blender for 3D…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotics and Sensor-Based Localization · Advanced Vision and Imaging · Advanced Image and Video Retrieval Techniques
