Alternating Loss Correction for Preterm-Birth Prediction from EHR Data with Noisy Labels
Sabri Boughorbel, Fethi Jarray, Neethu Venugopal, Haithum Elhadi

TL;DR
This paper introduces an alternating loss correction method for training deep neural networks on EHR data with noisy labels to improve preterm birth prediction accuracy.
Contribution
The paper proposes a novel alternating loss correction approach that leverages both clean and noisy labels in EHR data for better preterm birth prediction.
Findings
Improved prediction accuracy over baseline methods.
Effective use of noisy labels through the proposed correction method.
Enhanced model robustness with mixed label quality data.
Abstract
In this paper we are interested in the prediction of preterm birth based on diagnosis codes from longitudinal EHR. We formulate the prediction problem as a supervised classification with noisy labels. Our base classifier is a Recurrent Neural Network with an attention mechanism. We assume the availability of a data subset with both noisy and clean labels. For the cohort definition, most of the diagnosis codes on mothers' records related to pregnancy are ambiguous for the definition of full-term and preterm classes. On the other hand, diagnosis codes on babies' records provide fine-grained information on prematurity. Due to data de-identification, the links between mothers and babies are not available. We developed a heuristic based on admission and discharge times to match babies to their mothers and hence enrich mothers' records with additional information on delivery status. The…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Machine Learning in Healthcare · Domain Adaptation and Few-Shot Learning
