Speaker-Disentangled Remote Speech Detection of Asthma and COPD Exacerbations
Yuyang Yan, Sami O. Simons, Visara Urovi

TL;DR
This paper introduces an adversarial learning framework for remote speech-based detection of asthma and COPD exacerbations that disentangles disease-related features from speaker identity, improving accuracy and privacy.
Contribution
The novel adversarial architecture effectively separates pathology signals from speaker attributes, enhancing interpretability and generalizability in respiratory disease monitoring.
Findings
Improved AUC from 0.897 to 0.910 for respiratory status classification.
Enhanced AUC from 0.674 to 0.793 for exacerbation type classification.
Reduced speaker dependency confirmed by decreased J-ratio and external validation.
Abstract
Early detection of exacerbations in asthma and chronic obstructive pulmonary disease (COPD) is important for timely intervention. Speech has emerged as a promising tool for continuous, non-invasive respiratory disease monitoring. However, speech signals inherently carry speaker-identifiable attributes that may dominate model predictions, which may compromise both diagnosis performance and patient privacy. Furthermore, the acoustic features associated with respiratory disease and speaker identity remain unclear in respiratory disease monitoring. We propose an adversarial learning architecture that disentangles pathology-related acoustic patterns from speaker-identifiable attributes. The framework optimizes two clinically hierarchical tasks: (i) respiratory status classification (stable vs. exacerbated) and (ii) exacerbation type classification (asthma exacerbation vs. COPD exacerbation).…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
