Explainable Multi-Modal Deep Learning for Automatic Detection of Lung Diseases from Respiratory Audio Signals
S M Asiful Islam Saky, Md Rashidul Islam, Md Saiful Arefin, Shahaba Alam

TL;DR
This paper introduces an explainable multimodal deep learning system for automatic lung disease detection from respiratory sounds, combining spectral-temporal and handcrafted features, achieving high accuracy and interpretability.
Contribution
It presents a novel multimodal deep learning framework that integrates data-driven and domain-informed features with explainability tools for respiratory disease detection.
Findings
Achieved 91.21% accuracy in lung disease detection
Demonstrated the importance of temporal modeling and multimodal fusion
Provided interpretable explanations aligned with known acoustic biomarkers
Abstract
Respiratory diseases remain major global health challenges, and traditional auscultation is often limited by subjectivity, environmental noise, and inter-clinician variability. This study presents an explainable multimodal deep learning framework for automatic lung-disease detection using respiratory audio signals. The proposed system integrates two complementary representations: a spectral-temporal encoder based on a CNN-BiLSTM Attention architecture, and a handcrafted acoustic-feature encoder capturing physiologically meaningful descriptors such as MFCCs, spectral centroid, spectral bandwidth, and zero-crossing rate. These branches are combined through late-stage fusion to leverage both data-driven learning and domain-informed acoustic cues. The model is trained and evaluated on the Asthma Detection Dataset Version 2 using rigorous preprocessing, including resampling, normalization,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPhonocardiography and Auscultation Techniques · Respiratory and Cough-Related Research · Voice and Speech Disorders
