Med-Query: Steerable Parsing of 9-DoF Medical Anatomies with Query Embedding
Heng Guo, Jianfeng Zhang, Ke Yan, Le Lu, Minfeng Xu

TL;DR
Med-Query introduces a steerable, robust framework for 3D medical anatomy parsing that estimates 9-DoF poses and efficiently retrieves anatomies, improving accuracy and speed in CT scan analysis.
Contribution
This work presents a novel single-stage 9-DoF pose estimation method for anatomy parsing, enabling steerable retrieval and improved robustness in challenging CT scans.
Findings
Achieved 97.0% identification rate for ribs.
Attained 90.9% Dice score for rib segmentation.
Performed competitively on CTSpine1K and FLARE22 datasets.
Abstract
Automatic parsing of human anatomies at the instance-level from 3D computed tomography (CT) is a prerequisite step for many clinical applications. The presence of pathologies, broken structures or limited field-of-view (FOV) can all make anatomy parsing algorithms vulnerable. In this work, we explore how to leverage and implement the successful detection-then-segmentation paradigm for 3D medical data, and propose a steerable, robust, and efficient computing framework for detection, identification, and segmentation of anatomies in CT scans. Considering the complicated shapes, sizes, and orientations of anatomies, without loss of generality, we present a nine degrees of freedom (9-DoF) pose estimation solution in full 3D space using a novel single-stage, non-hierarchical representation. Our whole framework is executed in a steerable manner where any anatomy of interest can be directly…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMedical Imaging and Analysis · Dental Radiography and Imaging · Anatomy and Medical Technology
MethodsConvolution · Batch Normalization · Deep Layer Aggregation · Cascade Corner Pooling · Center Pooling · 1x1 Convolution · Non Maximum Suppression · Feature Pyramid Network · CenterNet · FCOS
