
TL;DR
This paper introduces a gradient boosting method for creating compact, interpretable risk scores that outperform traditional linear models in predictive accuracy across various datasets.
Contribution
It presents a novel gradient boosting algorithm for risk scores, enabling nonlinear modeling and producing more concise, interpretable models than existing linear approaches.
Findings
Achieves competitive predictive performance across diverse tasks.
Produces 60% fewer rules for classification and 16% fewer for time-to-event tasks.
Outperforms AutoScore in model compactness.
Abstract
Risk scores are an interpretable and actionable class of machine learning models with applications in medicine, insurance, and risk management. Unlike most computational methods, risk scores are designed to be computed by a human by attributing points to a data sample based on a limited set of criteria. The most common approaches for generating risk scores use linear regressions to estimate the effect of selected variables. We propose a simple and effective approach towards building compact and predictive risk scores. We provide an algorithm based on gradient boosting that is capable of modeling nonlinear effects, along with a C++ implementation with Python and R bindings. Through extensive empirical evaluation on twelve tabular datasets spanning regression, classification, and time-to-event tasks, we show that our method achieves competitive predictive performance while producing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
