AI-Powered Early Detection of Critical Diseases using Image Processing and Audio Analysis

Manisha More; Kavya Bhand; Kaustubh Mukdam; Kavya Sharma; Manas Kawtikwar; Hridayansh Kaware; Prajwal Kavhar

arXiv:2510.25199·cs.CV·October 30, 2025

AI-Powered Early Detection of Critical Diseases using Image Processing and Audio Analysis

Manisha More, Kavya Bhand, Kaustubh Mukdam, Kavya Sharma, Manas Kawtikwar, Hridayansh Kaware, Prajwal Kavhar

PDF

TL;DR

This paper introduces a multimodal AI framework combining image, thermal, and audio analysis for early detection of critical diseases, aiming to improve accessibility and reduce costs in healthcare diagnostics.

Contribution

It presents a novel integrated AI diagnostic system that combines multiple modalities and achieves high accuracy on diverse health conditions, suitable for low-resource settings.

Findings

01

Achieved 89.3% accuracy in skin cancer detection with MobileNetV2.

02

Supported thermal clot detection with 86.4% accuracy using SVM.

03

Classified cardiopulmonary sounds with 87.2% accuracy using Random Forest.

Abstract

Early diagnosis of critical diseases can significantly improve patient survival and reduce treatment costs. However, existing diagnostic techniques are often costly, invasive, and inaccessible in low-resource regions. This paper presents a multimodal artificial intelligence (AI) diagnostic framework integrating image analysis, thermal imaging, and audio signal processing for early detection of three major health conditions: skin cancer, vascular blood clots, and cardiopulmonary abnormalities. A fine-tuned MobileNetV2 convolutional neural network was trained on the ISIC 2019 dataset for skin lesion classification, achieving 89.3% accuracy, 91.6% sensitivity, and 88.2% specificity. A support vector machine (SVM) with handcrafted features was employed for thermal clot detection, achieving 86.4% accuracy (AUC = 0.89) on synthetic and clinical data. For cardiopulmonary analysis, lung and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.