Learning Camera Performance Models for Active Multi-Camera Visual Teach and Repeat
Mat\'ias Mattamala, Milad Ramezani, Marco Camurri, Maurice Fallon

TL;DR
This paper presents a robust multi-camera Visual Teach and Repeat system that actively selects the most informative camera stream for localization, enabling reliable navigation in dynamic, cluttered environments and on a quadruped robot.
Contribution
Introduction of Camera Performance Models for non-synchronized multi-camera VT&R, improving robustness and adaptability in challenging environments.
Findings
Successful localization despite occlusions and environmental changes
Achieved less than 10cm tracking precision in experiments
Generalization to four-camera setups in simulation
Abstract
In dynamic and cramped industrial environments, achieving reliable Visual Teach and Repeat (VT&R) with a single-camera is challenging. In this work, we develop a robust method for non-synchronized multi-camera VT&R. Our contribution are expected Camera Performance Models (CPM) which evaluate the camera streams from the teach step to determine the most informative one for localization during the repeat step. By actively selecting the most suitable camera for localization, we are able to successfully complete missions when one of the cameras is occluded, faces into feature poor locations or if the environment has changed. Furthermore, we explore the specific challenges of achieving VT&R on a dynamic quadruped robot, ANYmal. The camera does not follow a linear path (due to the walking gait and holonomicity) such that precise path-following cannot be achieved. Our experiments feature…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
