An Integrated Deep Learning Framework Leveraging NASNet and Vision   Transformer with MixProcessing for Accurate and Precise Diagnosis of Lung   Diseases

Sajjad Saleem; Muhammad Imran Sharif

arXiv:2502.20570·eess.IV·March 3, 2025

An Integrated Deep Learning Framework Leveraging NASNet and Vision Transformer with MixProcessing for Accurate and Precise Diagnosis of Lung Diseases

Sajjad Saleem, Muhammad Imran Sharif

PDF

TL;DR

This paper introduces NASNet-ViT, a deep learning framework combining NASNet and Vision Transformer with MixProcessing, achieving high accuracy in diagnosing multiple lung diseases efficiently for clinical use.

Contribution

It presents a novel integrated deep learning model that combines NASNet and Vision Transformer with a new preprocessing strategy for improved lung disease diagnosis.

Findings

01

Achieved 98.9% accuracy in classifying lung diseases.

02

Outperformed existing models like ResNet50 and MobileNet.

03

Model is compact and suitable for real-time clinical environments.

Abstract

The lungs are the essential organs of respiration, and this system is significant in the carbon dioxide and exchange between oxygen that occurs in human life. However, several lung diseases, which include pneumonia, tuberculosis, COVID-19, and lung cancer, are serious healthiness challenges and demand early and precise diagnostics. The methodological study has proposed a new deep learning framework called NASNet-ViT, which effectively incorporates the convolution capability of NASNet with the global attention mechanism capability of Vision Transformer ViT. The proposed model will classify the lung conditions into five classes: Lung cancer, COVID-19, pneumonia, TB, and normal. A sophisticated multi-faceted preprocessing strategy called MixProcessing has been used to improve diagnostic accuracy. This preprocessing combines wavelet transform, adaptive histogram equalization, and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsAttention Is All You Need · Byte Pair Encoding · Dense Connections · Residual Connection · Linear Layer · Absolute Position Encodings · Layer Normalization · Label Smoothing · Multi-Head Attention · Position-Wise Feed-Forward Layer