Warm-Started Reinforcement Learning for Iterative 3D/2D Liver Registration

Hanyuan Zhang; Lucas He; Zijie Cheng; Abdolrahim Kadkhodamohammadi; Danail Stoyanov; Brian R. Davidson; Evangelos B. Mazomenos; Matthew.J Clarkson

arXiv:2604.10245·cs.CV·May 21, 2026

Warm-Started Reinforcement Learning for Iterative 3D/2D Liver Registration

Hanyuan Zhang, Lucas He, Zijie Cheng, Abdolrahim Kadkhodamohammadi, Danail Stoyanov, Brian R. Davidson, Evangelos B. Mazomenos, Matthew.J Clarkson

PDF

TL;DR

This paper introduces a reinforcement learning framework for efficient, automated 3D/2D liver registration in surgical AR, achieving comparable accuracy to traditional methods with faster convergence.

Contribution

It presents a warm-started RL approach that formulates registration as a sequential decision process, improving speed and automation over existing supervised methods.

Findings

01

Achieved an average TRE of 15.70 mm on a public dataset.

02

Faster convergence compared to supervised optimization-based methods.

03

Automated registration without manual tuning of step sizes or stopping criteria.

Abstract

Registration between preoperative CT and intraoperative laparoscopic video plays a crucial role in augmented reality (AR) guidance for minimally invasive surgery. Learning-based methods have recently achieved registration errors comparable to optimization-based approaches while offering faster inference. However, many supervised methods produce coarse alignments that rely on additional optimization-based refinement, thereby increasing inference time. We present a discrete-action reinforcement learning (RL) framework that formulates CT-to-video registration as a sequential decision-making process. A shared feature encoder, warm-started from a supervised pose estimation network to provide stable geometric features and faster convergence, extracts representations from CT renderings and laparoscopic frames, while an RL policy head learns to choose rigid transformations along six degrees…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.