RGB-Multispectral Matching: Dataset, Learning Methodology, Evaluation

Fabio Tosi; Pierluigi Zama Ramirez; Matteo Poggi; Samuele Salti,; Stefano Mattoccia; Luigi Di Stefano

arXiv:2206.07047·cs.CV·June 15, 2022·1 cites

RGB-Multispectral Matching: Dataset, Learning Methodology, Evaluation

Fabio Tosi, Pierluigi Zama Ramirez, Matteo Poggi, Samuele Salti,, Stefano Mattoccia, Luigi Di Stefano

PDF

Open Access

TL;DR

This paper introduces a new dataset and a deep learning approach for registering RGB and multi-spectral images with different resolutions, achieving promising accuracy in a challenging cross-modal matching task.

Contribution

The paper presents a novel RGB-MS dataset with high-resolution disparity ground-truth and a self-supervised learning method leveraging RGB-RGB matching to improve cross-modal registration.

Findings

01

Achieved 1.16 pixels average registration error

02

Created a dataset with 34 annotated image pairs in indoor scenes

03

Demonstrated effective self-supervised training without ground-truth labels

Abstract

We address the problem of registering synchronized color (RGB) and multi-spectral (MS) images featuring very different resolution by solving stereo matching correspondences. Purposely, we introduce a novel RGB-MS dataset framing 13 different scenes in indoor environments and providing a total of 34 image pairs annotated with semi-dense, high-resolution ground-truth labels in the form of disparity maps. To tackle the task, we propose a deep learning architecture trained in a self-supervised manner by exploiting a further RGB camera, required only during training data acquisition. In this setup, we can conveniently learn cross-modal matching in the absence of ground-truth labels by distilling knowledge from an easier RGB-RGB matching task based on a collection of about 11K unlabeled image triplets. Experiments show that the proposed pipeline sets a good performance bar (1.16 pixels…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · Advanced Image and Video Retrieval Techniques · Robotics and Sensor-Based Localization