Deep Implicit Optimization enables Robust Learnable Features for   Deformable Image Registration

Rohit Jena; Pratik Chaudhari; James C. Gee

arXiv:2406.07361·cs.CV·May 6, 2025

Deep Implicit Optimization enables Robust Learnable Features for Deformable Image Registration

Rohit Jena, Pratik Chaudhari, James C. Gee

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel deep learning framework that integrates explicit optimization layers to improve deformable image registration, achieving robustness to domain shifts and enabling flexible transformation representations without retraining.

Contribution

It proposes a method that combines deep feature learning with implicit differentiation through an optimization solver, enhancing registration accuracy and flexibility over prior DLIR approaches.

Findings

01

Excellent in-domain registration performance

02

Robust to domain shifts like anisotropy and intensity variations

03

Allows switching transformation models at test time without retraining

Abstract

Deep Learning in Image Registration (DLIR) methods have been tremendously successful in image registration due to their speed and ability to incorporate weak label supervision at training time. However, existing DLIR methods forego many of the benefits and invariances of optimization methods. The lack of a task-specific inductive bias in DLIR methods leads to suboptimal performance, especially in the presence of domain shift. Our method aims to bridge this gap between statistical learning and optimization by explicitly incorporating optimization as a layer in a deep network. A deep network is trained to predict multi-scale dense feature images that are registered using a black box iterative optimization solver. This optimal warp is then used to minimize image and label alignment errors. By implicitly differentiating end-to-end through an iterative optimization solver, we explicitly…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

rohitrango/dio
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMedical Image Segmentation Techniques · Robotics and Sensor-Based Localization · Advanced Image and Video Retrieval Techniques

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings