Autonomous navigation of catheters and guidewires in mechanical thrombectomy using inverse reinforcement learning
Harry Robertshaw, Lennart Karstensen, Benjamin Jackson, Alejandro, Granados, Thomas C. Booth

TL;DR
This paper demonstrates the feasibility of autonomous catheter and guidewire navigation in mechanical thrombectomy using inverse reinforcement learning, achieving high success rates and emphasizing reward shaping for optimal performance.
Contribution
It introduces a novel application of inverse reinforcement learning for autonomous endovascular navigation, leveraging expert demonstrations to improve procedure success.
Findings
High success rates of 95-96% in navigation tasks.
Reward shaping significantly improved procedure time and success.
IRL-derived reward functions underperformed compared to reward shaping.
Abstract
Purpose: Autonomous navigation of catheters and guidewires can enhance endovascular surgery safety and efficacy, reducing procedure times and operator radiation exposure. Integrating tele-operated robotics could widen access to time-sensitive emergency procedures like mechanical thrombectomy (MT). Reinforcement learning (RL) shows potential in endovascular navigation, yet its application encounters challenges without a reward signal. This study explores the viability of autonomous navigation in MT vasculature using inverse RL (IRL) to leverage expert demonstrations. Methods: This study established a simulation-based training and evaluation environment for MT navigation. We used IRL to infer reward functions from expert behaviour when navigating a guidewire and catheter. We utilized soft actor-critic to train models with various reward functions and compared their performance in silico.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
