SkinFormer: Learning Statistical Texture Representation with Transformer   for Skin Lesion Segmentation

Rongtao Xu; Changwei Wang; Jiguang Zhang; Shibiao Xu; Weiliang Meng,; Xiaopeng Zhang

arXiv:2409.08652·eess.IV·September 16, 2024

SkinFormer: Learning Statistical Texture Representation with Transformer for Skin Lesion Segmentation

Rongtao Xu, Changwei Wang, Jiguang Zhang, Shibiao Xu, Weiliang Meng,, Xiaopeng Zhang

PDF

Open Access 1 Repo

TL;DR

SkinFormer is a novel transformer-based network that effectively integrates local structural and global statistical texture features for improved skin lesion segmentation, achieving state-of-the-art results on multiple datasets.

Contribution

The paper introduces a Kurtosis-guided Statistical Counting Operator and specialized transformers to enhance texture representation in skin lesion segmentation.

Findings

01

Achieves 93.2% Dice score on ISIC 2018 dataset.

02

Outperforms existing state-of-the-art methods in skin lesion segmentation.

03

Demonstrates effective fusion of structural and statistical texture features.

Abstract

Accurate skin lesion segmentation from dermoscopic images is of great importance for skin cancer diagnosis. However, automatic segmentation of melanoma remains a challenging task because it is difficult to incorporate useful texture representations into the learning process. Texture representations are not only related to the local structural information learned by CNN, but also include the global statistical texture information of the input image. In this paper, we propose a trans\textbf{Former} network (\textbf{SkinFormer}) that efficiently extracts and fuses statistical texture representation for \textbf{Skin} lesion segmentation. Specifically, to quantify the statistical texture of input features, a Kurtosis-guided Statistical Counting Operator is designed. We propose Statistical Texture Fusion Transformer and Statistical Texture Enhance Transformer with the help of Kurtosis-guided…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

rongtao-xu/skinformer
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFace recognition and analysis

MethodsAttention Is All You Need · Byte Pair Encoding · Absolute Position Encodings · Softmax · Label Smoothing · Layer Normalization · Dropout · Position-Wise Feed-Forward Layer · Residual Connection · Linear Layer