DINO-CXR: A self supervised method based on vision transformer for chest X-ray classification
Mohammadreza Shakouri, Fatemeh Iranmanesh, Mahdi Eftekhari

TL;DR
DINO-CXR introduces a self-supervised vision transformer approach for chest X-ray classification, effectively reducing labeled data needs and outperforming existing methods in accuracy for pneumonia and COVID-19 detection.
Contribution
It adapts the DINO self-supervised method to medical imaging, specifically chest X-ray classification, demonstrating improved accuracy with less labeled data.
Findings
Outperforms state-of-the-art methods in accuracy
Achieves comparable AUC and F-1 scores with less labeled data
Effective for pneumonia and COVID-19 detection
Abstract
The limited availability of labeled chest X-ray datasets is a significant bottleneck in the development of medical imaging methods. Self-supervised learning (SSL) can mitigate this problem by training models on unlabeled data. Furthermore, self-supervised pretraining has yielded promising results in visual recognition of natural images but has not been given much consideration in medical image analysis. In this work, we propose a self-supervised method, DINO-CXR, which is a novel adaptation of a self-supervised method, DINO, based on a vision transformer for chest X-ray classification. A comparative analysis is performed to show the effectiveness of the proposed method for both pneumonia and COVID-19 detection. Through a quantitative analysis, it is also shown that the proposed method outperforms state-of-the-art methods in terms of accuracy and achieves comparable results in terms of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCOVID-19 diagnosis using AI · AI in cancer detection · Radiomics and Machine Learning in Medical Imaging
MethodsMulti-Head Attention · Attention Is All You Need · Softmax · Linear Layer · Dense Connections · Residual Connection · Layer Normalization · Vision Transformer
