Advancing Medical Image Segmentation: Morphology-Driven Learning with Diffusion Transformer
Sungmin Kang, Jaeha Song, Jihie Kim

TL;DR
This paper introduces a Diffusion Transformer Segmentation model that leverages morphological analysis and self-supervised learning to improve medical image segmentation accuracy across multiple imaging modalities.
Contribution
It presents a novel transformer-based segmentation approach with morphology-driven learning techniques, outperforming previous models in noisy and complex medical images.
Findings
Better segmentation accuracy across CT, MRI, and lesion images
Robustness to noise and complex structures
Improved performance over existing models
Abstract
Understanding the morphological structure of medical images and precisely segmenting the region of interest or abnormality is an important task that can assist in diagnosis. However, the unique properties of medical imaging make clear segmentation difficult,and the high cost and time-consuming task of labeling leads to a coarse-grained representation of ground truth. Facing with these problems, we propose a novel Diffusion Transformer Segmentation (DTS) model for robust segmentation in the presence of noise. We propose an alternative to the dominant Denoising U-Net encoder through experiments applying a transformer architecture, which captures global dependency through self-attention. Additionally, we propose k-neighbor label smoothing, reverse boundary attention, and self-supervised learning with morphology-driven learning to improve the ability to identify complex structures. Our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAI in cancer detection · Medical Image Segmentation Techniques · Radiomics and Machine Learning in Medical Imaging
Methods*Communicated@Fast*How Do I Communicate to Expedia? · Linear Layer · Concatenated Skip Connection · Residual Connection · Multi-Head Attention · Attention Is All You Need · Position-Wise Feed-Forward Layer · Adam · Byte Pair Encoding · Convolution
