SegDT: A Diffusion Transformer-Based Segmentation Model for Medical Imaging

Salah Eddine Bekhouche; Gaby Maroun; Fadi Dornaika; Abdenour Hadid

arXiv:2507.15595·cs.CV·July 22, 2025

SegDT: A Diffusion Transformer-Based Segmentation Model for Medical Imaging

Salah Eddine Bekhouche, Gaby Maroun, Fadi Dornaika, Abdenour Hadid

PDF

TL;DR

SegDT is a diffusion transformer-based model for medical image segmentation that achieves state-of-the-art results with fast inference, suitable for real-world healthcare applications.

Contribution

Introduces SegDT, a novel diffusion transformer model that enhances medical image segmentation performance while maintaining low computational costs.

Findings

01

Achieves state-of-the-art segmentation accuracy on benchmark datasets.

02

Maintains fast inference speeds suitable for clinical use.

03

Demonstrates robustness across multiple medical imaging datasets.

Abstract

Medical image segmentation is crucial for many healthcare tasks, including disease diagnosis and treatment planning. One key area is the segmentation of skin lesions, which is vital for diagnosing skin cancer and monitoring patients. In this context, this paper introduces SegDT, a new segmentation model based on diffusion transformer (DiT). SegDT is designed to work on low-cost hardware and incorporates Rectified Flow, which improves the generation quality at reduced inference steps and maintains the flexibility of standard diffusion models. Our method is evaluated on three benchmarking datasets and compared against several existing works, achieving state-of-the-art results while maintaining fast inference speeds. This makes the proposed model appealing for real-world medical applications. This work advances the performance and capabilities of deep learning models in medical image…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.