TransNuSeg: A Lightweight Multi-Task Transformer for Nuclei Segmentation
Zhenqi He, Mathias Unberath, Jing Ke, Yiqing Shen

TL;DR
TransNuSeg introduces a lightweight, multi-task Transformer framework for nuclei segmentation that outperforms existing CNN-based methods by leveraging shared attention and self-distillation, reducing complexity while maintaining high accuracy.
Contribution
It is the first pure Transformer approach for nuclei segmentation, employing multi-task learning, attention sharing, and a token MLP bottleneck to improve efficiency and accuracy.
Findings
Outperforms state-of-the-art methods by 2-3% Dice score.
Uses 30% fewer parameters than comparable models.
Effective in different modality datasets.
Abstract
Nuclei appear small in size, yet, in real clinical practice, the global spatial information and correlation of the color or brightness contrast between nuclei and background, have been considered a crucial component for accurate nuclei segmentation. However, the field of automatic nuclei segmentation is dominated by Convolutional Neural Networks (CNNs), meanwhile, the potential of the recently prevalent Transformers has not been fully explored, which is powerful in capturing local-global correlations. To this end, we make the first attempt at a pure Transformer framework for nuclei segmentation, called TransNuSeg. Different from prior work, we decouple the challenging nuclei segmentation task into an intrinsic multi-task learning task, where a tri-decoder structure is employed for nuclei instance, nuclei edge, and clustered edge segmentation respectively. To eliminate the divergent…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRadiomics and Machine Learning in Medical Imaging · AI in cancer detection · Cervical Cancer and HPV Research
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Byte Pair Encoding · Position-Wise Feed-Forward Layer · Label Smoothing · Residual Connection · Absolute Position Encodings · Adam · Layer Normalization
