Transfer Learning for Microstructure Segmentation with CS-UNet: A Hybrid   Algorithm with Transformer and CNN Encoders

Khaled Alrfou; Tian Zhao; Amir Kordijazi

arXiv:2308.13917·cs.CV·August 29, 2023·5 cites

Transfer Learning for Microstructure Segmentation with CS-UNet: A Hybrid Algorithm with Transformer and CNN Encoders

Khaled Alrfou, Tian Zhao, Amir Kordijazi

PDF

Open Access 1 Repo

TL;DR

This paper compares the effectiveness of Transformer and CNN encoders pre-trained on microscopy images for microstructure segmentation, demonstrating that their combination enhances performance, especially with limited training data and out-of-distribution images.

Contribution

It introduces a hybrid CS-UNet model combining Transformer and CNN encoders pre-trained on microscopy images, showing improved segmentation performance over CNN alone.

Findings

01

Pre-training on microscopy images improves out-of-distribution segmentation.

02

Hybrid Transformer-CNN models outperform CNN-only models.

03

Transformers and CNNs complement each other for better segmentation.

Abstract

Transfer learning improves the performance of deep learning models by initializing them with parameters pre-trained on larger datasets. Intuitively, transfer learning is more effective when pre-training is on the in-domain datasets. A recent study by NASA has demonstrated that the microstructure segmentation with encoder-decoder algorithms benefits more from CNN encoders pre-trained on microscopy images than from those pre-trained on natural images. However, CNN models only capture the local spatial relations in images. In recent years, attention networks such as Transformers are increasingly used in image analysis to capture the long-range relations between pixels. In this study, we compare the segmentation performance of Transformer and CNN models pre-trained on microscopy images with those pre-trained on natural images. Our result partially confirms the NASA study that the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

kalrfou/swint-pretrained-microscopy-models
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMineral Processing and Grinding · Advanced X-ray and CT Imaging · Machine Learning in Materials Science

MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Softmax · Dense Connections · Layer Normalization · Dropout · Byte Pair Encoding · Adam · Position-Wise Feed-Forward Layer