Multimodal Spatial Attention Module for Targeting Multimodal PET-CT Lung Tumor Segmentation
Xiaohang Fu, Lei Bi, Ashnil Kumar, Michael Fulham, Jinman Kim

TL;DR
This paper introduces a multimodal spatial attention module (MSAM) that enhances PET-CT lung tumor segmentation by focusing on tumor-related regions, significantly improving accuracy over existing methods.
Contribution
The novel MSAM effectively leverages PET-CT data for better tumor localization, surpassing state-of-the-art segmentation approaches with a simple, end-to-end trainable module.
Findings
MSAM improves segmentation accuracy by 7.6% DSC over previous methods.
The module is adaptable to common CNN architectures like U-Net.
Validated on two clinical datasets of NSCLC and STS.
Abstract
Multimodal positron emission tomography-computed tomography (PET-CT) is used routinely in the assessment of cancer. PET-CT combines the high sensitivity for tumor detection with PET and anatomical information from CT. Tumor segmentation is a critical element of PET-CT but at present, there is not an accurate automated segmentation method. Segmentation tends to be done manually by different imaging experts and it is labor-intensive and prone to errors and inconsistency. Previous automated segmentation methods largely focused on fusing information that is extracted separately from the PET and CT modalities, with the underlying assumption that each modality contains complementary information. However, these methods do not fully exploit the high PET tumor sensitivity that can guide the segmentation. We introduce a multimodal spatial attention module (MSAM) that automatically learns to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsAverage Pooling · *Communicated@Fast*How Do I Communicate to Expedia? · Max Pooling · Sigmoid Activation · Convolution · Communication--Guide||How Do I Communicate to Expedia? · Concatenated Skip Connection · U-Net
