Mitigating Sex Bias in Audio Data-driven COPD and COVID-19 Breathing Pattern Detection Models
Rachel Pfeifer, Sudip Vhaduri, James Eric Dietz

TL;DR
This paper investigates sex bias in machine learning models for respiratory disease detection from audio data, and demonstrates bias mitigation techniques that significantly improve fairness metrics.
Contribution
It introduces bias analysis and mitigation methods for breathing pattern models in COPD and COVID-19 detection, focusing on demographic fairness.
Findings
Bias mitigation improved demographic parity by 81.43%
Equalized odds difference was reduced by 71.81%
Bias mitigation techniques were statistically significant
Abstract
In the healthcare industry, researchers have been developing machine learning models to automate diagnosing patients with respiratory illnesses based on their breathing patterns. However, these models do not consider the demographic biases, particularly sex bias, that often occur when models are trained with a skewed patient dataset. Hence, it is essential in such an important industry to reduce this bias so that models can make fair diagnoses. In this work, we examine the bias in models used to detect breathing patterns of two major respiratory diseases, i.e., chronic obstructive pulmonary disease (COPD) and COVID-19. Using decision tree models trained with audio recordings of breathing patterns obtained from two open-source datasets consisting of 29 COPD and 680 COVID-19-positive patients, we analyze the effect of sex bias on the models. With a threshold optimizer and two constraints…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPhonocardiography and Auscultation Techniques · Voice and Speech Disorders · Chronic Obstructive Pulmonary Disease (COPD) Research
