Investigation into respiratory sound classification for an imbalanced data set using hybrid LSTM-KAN architectures
Nithinkumar K.V, Anand R

TL;DR
This paper presents a hybrid deep learning approach combining LSTM and KAN architectures to improve respiratory sound classification accuracy in highly imbalanced datasets, utilizing advanced imbalance mitigation techniques.
Contribution
It introduces a novel hybrid LSTM-KAN model with specific strategies for class imbalance, achieving high accuracy and better minority class detection.
Findings
Overall accuracy of 94.6% achieved
Macro F1 score of 0.703 obtained
Enhanced minority class recognition compared to baselines
Abstract
Respiratory sounds captured via auscultation contain critical clues for diagnosing pulmonary conditions. Automated classification of these sounds faces challenges due to subtle acoustic differences and severe class imbalance in clinical datasets. This study investigates respiratory sound classification with a focus on mitigating pronounced class imbalance. We propose a hybrid deep learning model that combines a Long Short-Term Memory (LSTM) network for sequential feature encoding with a Kolmogorov-Arnold Network (KAN) for classification. The model is integrated with a comprehensive feature extraction pipeline and targeted imbalance mitigation strategies. Experiments were conducted on a public respiratory sound database comprising six classes with a highly skewed distribution. Techniques such as focal loss, class-specific data augmentation, and Synthetic Minority Over-sampling Technique…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPhonocardiography and Auscultation Techniques · Respiratory and Cough-Related Research · Voice and Speech Disorders
