AutoScore-Imbalance: An interpretable machine learning tool for development of clinical scores with rare events data
Han Yuan, Feng Xie, Marcus Eng Hock Ong, Yilin Ning, Marcel Lucas, Chee, Seyed Ehsan Saffari, Hairil Rizal Abdullah, Benjamin Alan Goldstein,, Bibhas Chakraborty, Nan Liu

TL;DR
AutoScore-Imbalance is an interpretable machine learning tool designed to improve clinical score development for rare events by optimizing training data and sample weights, leading to better prediction of inpatient mortality.
Contribution
It introduces a novel framework that enhances AutoScore for unbalanced data, combining dataset and sample weight optimization to improve predictive performance.
Findings
AutoScore-Imbalance outperformed baseline models in AUC and accuracy.
A 9-variable sub-model achieved an AUC of 0.786, surpassing other models.
The method effectively balances performance and variable sparsity.
Abstract
Background: Medical decision-making impacts both individual and public health. Clinical scores are commonly used among a wide variety of decision-making models for determining the degree of disease deterioration at the bedside. AutoScore was proposed as a useful clinical score generator based on machine learning and a generalized linear model. Its current framework, however, still leaves room for improvement when addressing unbalanced data of rare events. Methods: Using machine intelligence approaches, we developed AutoScore-Imbalance, which comprises three components: training dataset optimization, sample weight optimization, and adjusted AutoScore. All scoring models were evaluated on the basis of their area under the curve (AUC) in the receiver operating characteristic analysis and balanced accuracy (i.e., mean value of sensitivity and specificity). By utilizing a publicly accessible…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Healthcare · Medical Coding and Health Information · Artificial Intelligence in Healthcare
MethodsLogistic Regression
