# SwinDAF3D: Pyramid Swin Transformers with Deep Attentive Features for Automated Finger Joint Segmentation in 3D Ultrasound Images for Rheumatoid Arthritis Assessment

**Authors:** Jianwei Qiu, Grigorios M. Karageorgos, Xiaorui Peng, Soumya Ghose, Zhaoyuan Yang, Aaron Dentinger, Zhanpeng Xu, Janggun Jo, Siddarth Ragupathi, Guan Xu, Nada Abdulaziz, Girish Gandikota, Xueding Wang, David Mills

PMC · DOI: 10.3390/bioengineering12040390 · Bioengineering · 2025-04-05

## TL;DR

This paper introduces SwinDAF3D, a new AI model that improves the accuracy of segmenting finger joints in 3D ultrasound images for rheumatoid arthritis assessment.

## Contribution

The novel integration of Swin Transformers with a deep attentive features framework for 3D ultrasound segmentation.

## Key findings

- SwinDAF3D achieved a Dice Score of 0.838 ± 0.013, outperforming existing models in synovium segmentation.
- The model showed improved performance in capturing multi-scale and attentive contextual information in 3D ultrasound.
- Results suggest potential for more efficient and standardized rheumatoid arthritis screening using ultrasound.

## Abstract

Rheumatoid arthritis (RA) is a chronic autoimmune disease that can cause severe joint damage and functional impairment. Ultrasound imaging has shown promise in providing real-time assessment of synovium inflammation associated with the early stages of RA. Accurate segmentation of the synovium region and quantification of inflammation-specific imaging biomarkers are crucial for assessing and grading RA. However, automatic segmentation of the synovium in 3D ultrasound is challenging due to ambiguous boundaries, variability in synovium shape, and inhomogeneous intensity distribution. In this work, we introduce a novel network architecture, Swin Transformers with Deep Attentive Features for 3D segmentation (SwinDAF3D), which integrates Swin Transformers into a Deep Attentive Features framework. The developed architecture leverages the hierarchical structure and shifted windows of Swin Transformers to capture rich, multi-scale and attentive contextual information, improving the modeling of long-range dependencies and spatial hierarchies in 3D ultrasound images. In a six-fold cross-validation study with 3D ultrasound images of RA patients’ finger joints (n = 72), our SwinDAF3D model achieved the highest performance with a Dice Score (DSC) of 0.838 ± 0.013, an Intersection over Union (IoU) of 0.719 ± 0.019, and Surface Dice Score (SDSC) of 0.852 ± 0.020, compared to 3D UNet (DSC: 0.742 ± 0.025; IoU: 0.589 ± 0.031; SDSC: 0.661 ± 0.029), DAF3D (DSC: 0.813 ± 0.017; IoU: 0.689 ± 0.022; SDSC: 0.817 ± 0.013), Swin UNETR (DSC: 0.808 ± 0.025; IoU: 0.678 ± 0.032; SDSC: 0.822 ± 0.039), UNETR++ (DSC: 0.810 ± 0.014; IoU: 0.684 ± 0.018; SDSC: 0.829 ± 0.027) and TransUNet (DSC: 0.818 ± 0.013; IoU: 0.692 ± 0.017; SDSC: 0.815 ± 0.016) models. This ablation study demonstrates the effectiveness of combining a Swin Transformers feature pyramid with a deep attention mechanism, improving the segmentation accuracy of the synovium in 3D ultrasound. This advancement shows great promise in enabling more efficient and standardized RA screening using ultrasound imaging.

## Linked entities

- **Diseases:** Rheumatoid arthritis (MONDO:0008383)

## Full-text entities

- **Diseases:** autoimmune disease (MESH:D001327), inflammation (MESH:D007249), RA (MESH:D001172), joint damage (MESH:D007592), functional impairment (MESH:D003072)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12025309/full.md

## Figures

11 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12025309/full.md

## References

57 references — full list in the complete paper: https://tomesphere.com/paper/PMC12025309/full.md

---
Source: https://tomesphere.com/paper/PMC12025309