Localization: A Missing Link in the Pipeline of Object Matching and Registration
Deepak Mishra, Rajeev Ranjan, Santanu Chaudhury, Mukul Sarkar,, Arvinder Singh Soin

TL;DR
This paper introduces a CNN-based framework for image registration that incorporates object localization, segmentation, and matching, improving accuracy and generalization in real-world scenarios like medical imaging.
Contribution
It presents a novel combination of localization, segmentation, and spatial transformer networks for robust image registration, especially in complex and real-world data.
Findings
Achieved 79% dice coefficient on MNIST dataset.
Achieved 94% dice coefficient on Caltech-101 dataset.
Extended framework successfully registered CT and US images.
Abstract
Image registration is a process of aligning two or more images of same objects using geometric transformation. Most of the existing approaches work on the assumption of location invariance. These approaches require object-centric images to perform matching. Further, in absence of intensity level symmetry between the corresponding points in two images, the learning based registration approaches rely on synthetic deformations, which often fail in real scenarios. To address these issues, a combination of convolutional neural networks (CNNs) to perform the desired registration is developed in this work. The complete objective is divided into three sub-objectives: object localization, segmentation and matching transformation. Object localization step establishes an initial correspondence between the images. A modified version of single shot multi-box detector is used for this purpose. The…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Medical Image Segmentation Techniques
MethodsLinear Layer · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Spatial Transformer · Residual Connection · Byte Pair Encoding · Dense Connections · Label Smoothing · *Communicated@Fast*How Do I Communicate to Expedia? · Adam
