CAT: Coordinating Anatomical-Textual Prompts for Multi-Organ and Tumor   Segmentation

Zhongzhen Huang; Yankai Jiang; Rongzhao Zhang; Shaoting Zhang; Xiaofan; Zhang

arXiv:2406.07085·cs.CV·November 1, 2024·2 cites

CAT: Coordinating Anatomical-Textual Prompts for Multi-Organ and Tumor Segmentation

Zhongzhen Huang, Yankai Jiang, Rongzhao Zhang, Shaoting Zhang, Xiaofan, Zhang

PDF

Open Access 1 Repo 1 Video

TL;DR

The paper introduces CAT, a novel model that combines visual and textual prompts to improve multi-organ and tumor segmentation in medical images, addressing the limitations of existing prompt-based methods.

Contribution

It proposes a dual-prompt schema and a unified framework with a ShareRefiner to enhance segmentation accuracy across diverse medical imaging scenarios.

Findings

01

Superior performance on 10 public CT datasets

02

Effective segmentation of tumors across multiple cancer stages

03

Demonstrates the benefits of multimodal prompt coordination

Abstract

Existing promptable segmentation methods in the medical imaging field primarily consider either textual or visual prompts to segment relevant objects, yet they often fall short when addressing anomalies in medical images, like tumors, which may vary greatly in shape, size, and appearance. Recognizing the complexity of medical scenarios and the limitations of textual or visual prompts, we propose a novel dual-prompt schema that leverages the complementary strengths of visual and textual prompts for segmenting various organs and tumors. Specifically, we introduce CAT, an innovative model that Coordinates Anatomical prompts derived from 3D cropped images with Textual prompts enriched by medical domain knowledge. The model architecture adopts a general query-based design, where prompt queries facilitate segmentation queries for mask prediction. To synergize two types of prompts within a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

zongzi3zz/cat
pytorchOfficial

Videos

CAT: Coordinating Anatomical-Textual Prompts for Multi-Organ and Tumor Segmentation· slideslive

Taxonomy

TopicsBiomedical Text Mining and Ontologies · Topic Modeling · Natural Language Processing Techniques