Refining 3D Medical Segmentation with Verbal Instruction
Kangxian Xie, Jiancheng Yang, Nandor Pinter, Chao Wu, Behzad Bozorgtabar, Mingchen Gao

TL;DR
This paper introduces CoWTalk, a benchmark for 3D arterial shape correction with verbal instructions, and proposes an iterative refinement model that improves shape predictions using natural language feedback.
Contribution
The paper presents a new benchmark dataset and a novel language-interactive refinement method for 3D medical shape segmentation.
Findings
Significant improvement over corrupted inputs
Competitive performance against baseline methods
Feasibility of language-driven shape refinement
Abstract
Accurate 3D anatomical segmentation is essential for clinical diagnosis and surgical planning. However, automated models frequently generate suboptimal shape predictions due to factors such as limited and imbalanced training data, inadequate labeling quality, and distribution shifts between training and deployment settings. A natural solution is to iteratively refine the predicted shape based on the radiologists' verbal instructions. However, this is hindered by the scarcity of paired data that explicitly links erroneous shapes to corresponding corrective instructions. As an initial step toward addressing this limitation, we introduce CoWTalk, a benchmark comprising 3D arterial anatomies with controllable synthesized anatomical errors and their corresponding repairing instructions. Building on this benchmark, we further propose an iterative refinement model that represents 3D shapes as…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMedical Imaging and Analysis · Medical Image Segmentation Techniques · Anatomy and Medical Technology
