Evaluating Pre-Training Bias on Severe Acute Respiratory Syndrome   Dataset

Diego Dimer Rodrigues

arXiv:2408.15398·cs.LG·August 29, 2024

Evaluating Pre-Training Bias on Severe Acute Respiratory Syndrome Dataset

Diego Dimer Rodrigues

PDF

Open Access

TL;DR

This paper investigates how pre-training bias metrics vary across regions in Brazil using the SARS dataset, aiming to identify potential biases before model training and deployment.

Contribution

It introduces a visualization approach for pre-training bias metrics across regions, highlighting regional differences in bias and model performance.

Findings

01

Bias metrics vary significantly across regions.

02

Model performance correlates with bias metric values.

03

Visualization helps identify potential harm before training.

Abstract

Machine learning (ML) is a growing field of computer science that has found many practical applications in several domains, including Health. However, as data grows in size and availability, and the number of models that aim to aid or replace human decisions, it raises the concern that these models can be susceptible to bias, which can lead to harm to specific individuals by basing its decisions on protected attributes such as gender, religion, sexual orientation, ethnicity, and others. Visualization techniques might generate insights and help summarize large datasets, enabling data scientists to understand the data better before training a model by evaluating pre-training metrics applied to the datasets before training, which might contribute to identifying potential harm before any effort is put into training and deploying the models. This work uses the severe acute respiratory…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNon-Invasive Vital Sign Monitoring · Artificial Intelligence in Healthcare · Heart Rate Variability and Autonomic Control