H-Net: A Multitask Architecture for Simultaneous 3D Force Estimation and   Stereo Semantic Segmentation in Intracardiac Catheters

Pedram Fekri; Mehrdad Zadeh; Javad Dargahi

arXiv:2501.00514·eess.IV·January 3, 2025

H-Net: A Multitask Architecture for Simultaneous 3D Force Estimation and Stereo Semantic Segmentation in Intracardiac Catheters

Pedram Fekri, Mehrdad Zadeh, Javad Dargahi

PDF

TL;DR

This paper introduces H-Net, a lightweight multitask neural network that simultaneously performs 3D force estimation and stereo semantic segmentation of intracardiac catheters from biplane X-ray images, advancing real-time catheter navigation.

Contribution

It presents the first integrated architecture capable of concurrent catheter segmentation from two views and 3D force estimation, optimized for limited computational resources.

Findings

01

Achieved state-of-the-art performance in segmentation accuracy.

02

Demonstrated precise 3D force estimation from stereo images.

03

Validated effectiveness on intracardiac catheter datasets.

Abstract

The success rate of catheterization procedures is closely linked to the sensory data provided to the surgeon. Vision-based deep learning models can deliver both tactile and visual information in a sensor-free manner, while also being cost-effective to produce. Given the complexity of these models for devices with limited computational resources, research has focused on force estimation and catheter segmentation separately. However, there is a lack of a comprehensive architecture capable of simultaneously segmenting the catheter from two different angles and estimating the applied forces in 3D. To bridge this gap, this work proposes a novel, lightweight, multi-input, multi-output encoder-decoder-based architecture. It is designed to segment the catheter from two points of view and concurrently measure the applied forces in the x, y, and z directions. This network processes two…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.