Contrastive Learning Improves Critical Event Prediction in COVID-19 Patients
Tingyi Wanyan, Hossein Honarvar, Suraj K. Jaladanki, Chengxi Zang,, Nidhi Naik, Sulaiman Somani, Jessica K. De Freitas, Ishan Paranjpe, Akhil, Vaid, Riccardo Miotto, Girish N. Nadkarni, Marinka Zitnik, ArifulAzad, Fei, Wang, Ying Ding, Benjamin S. Glicksberg

TL;DR
This study demonstrates that contrastive loss enhances machine learning models' ability to predict critical COVID-19 outcomes from imbalanced electronic health record data, outperforming traditional methods in accuracy and feature identification.
Contribution
First application of contrastive loss to improve COVID-19 outcome prediction models using imbalanced EHR data, showing consistent performance gains over cross-entropy loss.
Findings
Contrastive loss improves model performance on imbalanced data.
CL maintains proper feature clustering and importance.
Models with CL outperform CEL in AUROC and AUPRC metrics.
Abstract
Machine Learning (ML) models typically require large-scale, balanced training data to be robust, generalizable, and effective in the context of healthcare. This has been a major issue for developing ML models for the coronavirus-disease 2019 (COVID-19) pandemic where data is highly imbalanced, particularly within electronic health records (EHR) research. Conventional approaches in ML use cross-entropy loss (CEL) that often suffers from poor margin classification. For the first time, we show that contrastive loss (CL) improves the performance of CEL especially for imbalanced EHR data and the related COVID-19 analyses. This study has been approved by the Institutional Review Board at the Icahn School of Medicine at Mount Sinai. We use EHR data from five hospitals within the Mount Sinai Health System (MSHS) to predict mortality, intubation, and intensive care unit (ICU) transfer in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
