From Scanning Guidelines to Action: A Robotic Ultrasound Agent with LLM-Based Reasoning
Yuan Bi, Yiping Zhou, Pei Liu, Feng Li, Zhongliang Jiang, Nassir Navab

TL;DR
This paper introduces a robotic ultrasound system that uses a large language model to interpret guidelines and adapt its scanning strategy dynamically, improving flexibility and reliability over fixed procedures.
Contribution
A novel LLM-based framework for autonomous robotic ultrasound that interprets guidelines, reasons over steps, and adapts scanning strategies in real-time, with RL fine-tuning for improved decision-making.
Findings
Successfully interpreted 10 ultrasound guidelines verbally.
Enhanced reasoning and tool selection with RL fine-tuning.
Demonstrated real-world autonomous scanning of multiple anatomical targets.
Abstract
Robotic ultrasound offers advantages over free-hand scanning, including improved reproducibility and reduced operator dependency. In clinical practice, US acquisition relies heavily on the sonographer's experience and situational judgment. When transferring this process to robotic systems, such expertise is often encoded explicitly through fixed procedures and task-specific models, yielding pipelines that can be difficult to adapt to new scanning tasks. In this work, we propose a unified framework for autonomous robotic US scanning that leverages a LLM-based agent to interpret US scanning guidelines and execute scans by dynamically invoking a set of provided software tools. Instead of encoding fixed scanning procedures, the LLM agent retrieves and reasons over guideline steps from scanning handbooks and adapts its planning decisions based on observations and the current scanning state.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoft Robotics and Applications · Multimodal Machine Learning Applications · Fetal and Pediatric Neurological Disorders
