VesNet-RL: Simulation-based Reinforcement Learning for Real-World US Probe Navigation
Yuan Bi, Zhongliang Jiang, Yuan Gao, Thomas Wendler, Angelos Karlas,, and Nassir Navab

TL;DR
This paper introduces VesNet-RL, a simulation-based reinforcement learning framework for real-world ultrasound probe navigation, achieving accurate vessel view alignment without extensive real-world training.
Contribution
It presents a novel simulation-based RL method with a multi-modality state representation and a vessel view recognition approach for ultrasound probe navigation.
Findings
Effective virtual validation on 3D carotid artery volumes
Successful physical validation on gel phantoms
Accurate navigation to vessel longitudinal views
Abstract
Ultrasound (US) is one of the most common medical imaging modalities since it is radiation-free, low-cost, and real-time. In freehand US examinations, sonographers often navigate a US probe to visualize standard examination planes with rich diagnostic information. However, reproducibility and stability of the resulting images often suffer from intra- and inter-operator variation. Reinforcement learning (RL), as an interaction-based learning method, has demonstrated its effectiveness in visual navigating tasks; however, RL is limited in terms of generalization. To address this challenge, we propose a simulation-based RL framework for real-world navigation of US probes towards the standard longitudinal views of vessels. A UNet is used to provide binary masks from US images; thereby, the RL agent trained on simulated binary vessel images can be applied in real scenarios without further…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMedical Image Segmentation Techniques · Multimodal Machine Learning Applications · Mobile Crowdsensing and Crowdsourcing
