Non-Heuristic Selection via Hybrid Regularized and Machine Learning Models for Insurance
Luciano Ribeiro Galv\~ao, Rafael de Andrade Moral

TL;DR
This paper explores a hybrid modeling approach combining regularized feature selection with machine learning classifiers to predict insurance purchase behavior, achieving high accuracy while maintaining interpretability.
Contribution
It introduces a hybrid modeling framework that integrates regularized models with advanced machine learning classifiers for improved insurance customer prediction.
Findings
Hybrid CatBoost with Lasso achieved AUC of 0.861
Hybrid models outperformed individual classifiers in predictive metrics
Regularized feature selection maintained interpretability while enhancing accuracy
Abstract
In this study, machine learning models were tested to predict whether or not a customer of an insurance company would purchase a travel insurance product. For this purpose, secondary data provided by an open-source website that compiles databases from statistical modeling competitions were used. The dataset used presents approximately 2,700 records from an unidentified company in the tourism insurance sector. Initially, the feature engineering stage was carried out, which were selected through regularized models: Ridge, Lasso and Elastic-Net. In this phase, gains were observed not only in relation to dimensionality, but also in the maintenance of interpretative capacity, through the coefficients obtained. After this process, five classification models were evaluated (Random Forests, XGBoost, H2O GBM, LightGBM and CatBoost) separately and in a hybrid way with the previous regularized…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImbalanced Data Classification Techniques · Financial Distress and Bankruptcy Prediction · Customer churn and segmentation
