Improved Techniques for the Conditional Generative Augmentation of Clinical Audio Data
Mane Margaryan, Matthias Seibold, Indu Joshi, Mazda Farshad, Philipp, F\"urnstahl, Nassir Navab

TL;DR
This paper introduces a novel conditional GAN-based method with residual Squeeze and Excitation modules to synthesize clinical audio data, outperforming classical augmentation techniques and improving classifier performance.
Contribution
The paper presents a new GAN architecture with residual Squeeze and Excitation modules for clinical audio data augmentation, enhancing sample quality and classifier accuracy.
Findings
Outperforms classical audio augmentation techniques.
Achieves 2.84% improvement in Macro F1-Score.
Reduces redundancy in latent features.
Abstract
Data augmentation is a valuable tool for the design of deep learning systems to overcome data limitations and stabilize the training process. Especially in the medical domain, where the collection of large-scale data sets is challenging and expensive due to limited access to patient data, relevant environments, as well as strict regulations, community-curated large-scale public datasets, pretrained models, and advanced data augmentation methods are the main factors for developing reliable systems to improve patient care. However, for the development of medical acoustic sensing systems, an emerging field of research, the community lacks large-scale publicly available data sets and pretrained models. To address the problem of limited data, we propose a conditional generative adversarial neural network-based augmentation method which is able to synthesize mel spectrograms from a learned…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and Audio Processing · Music and Audio Processing · Phonocardiography and Auscultation Techniques
