Efficient and Accurate Tuberculosis Diagnosis: Attention Residual U-Net and Vision Transformer Based Detection Framework
Greeshma K, Vishnukumar S

TL;DR
This paper introduces a two-stage deep learning framework combining an attention residual U-Net for segmentation and a Vision Transformer for classification to improve tuberculosis bacilli detection in microscopic images, enhancing accuracy and automation.
Contribution
It presents a novel two-stage deep learning approach with an attention residual U-Net and a customized Vision Transformer for TB detection, outperforming existing methods.
Findings
Improved segmentation accuracy over existing models
Higher classification precision with TBViT
Enhanced automation in TB diagnosis workflow
Abstract
Tuberculosis (TB), an infectious disease caused by Mycobacterium tuberculosis, continues to be a major global health threat despite being preventable and curable. This burden is particularly high in low and middle income countries. Microscopy remains essential for diagnosing TB by enabling direct visualization of Mycobacterium tuberculosis in sputum smear samples, offering a cost effective approach for early detection and effective treatment. Given the labour-intensive nature of microscopy, automating the detection of bacilli in microscopic images is crucial to improve both the expediency and reliability of TB diagnosis. The current methodologies for detecting tuberculosis bacilli in bright field microscopic sputum smear images are hindered by limited automation capabilities, inconsistent segmentation quality, and constrained classification precision. This paper proposes a twostage deep…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Processing Techniques and Applications · Digital Imaging for Blood Diseases · COVID-19 diagnosis using AI
MethodsAttention Is All You Need · Max Pooling · Convolution · *Communicated@Fast*How Do I Communicate to Expedia? · Absolute Position Encodings · Softmax · Linear Layer · Adam · Concatenated Skip Connection · Residual Connection
