Fast computation of mutual information in the frequency domain with   applications to global multimodal image alignment

Johan \"Ofverstedt; Joakim Lindblad; Nata\v{s}a Sladoje

arXiv:2106.14699·cs.CV·July 1, 2022

Fast computation of mutual information in the frequency domain with applications to global multimodal image alignment

Johan \"Ofverstedt, Joakim Lindblad, Nata\v{s}a Sladoje

PDF

1 Repo

TL;DR

This paper introduces a fast, frequency domain-based algorithm for computing mutual information across all displacements, enabling efficient global multimodal image alignment with significant speed improvements over previous methods.

Contribution

The authors develop an asymptotically faster algorithm for mutual information computation in the frequency domain, facilitating global multimodal image alignment.

Findings

01

Achieves high success rates in rigid transformation recovery across diverse datasets.

02

Outperforms existing local and deep learning-based methods in accuracy.

03

GPU implementation yields speed-ups of 100 to over 10,000 times.

Abstract

Multimodal image alignment is the process of finding spatial correspondences between images formed by different imaging techniques or under different conditions, to facilitate heterogeneous data fusion and correlative analysis. The information-theoretic concept of mutual information (MI) is widely used as a similarity measure to guide multimodal alignment processes, where most works have focused on local maximization of MI that typically works well only for small displacements; this points to a need for global maximization of MI, which has previously been computationally infeasible due to the high run-time complexity of existing algorithms. We propose an efficient algorithm for computing MI for all discrete displacements (formalized as the cross-mutual information function (CMIF)), which is based on cross-correlation computed in the frequency domain. We show that the algorithm is…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

MIDA-group/globalign
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.