A machine learning workflow to address credit default prediction
Rambod Rahmani, Marco Parola, and Mario G.C.A. Cimino

TL;DR
This paper presents a comprehensive machine learning workflow for credit default prediction that integrates data preprocessing, ensemble modeling, and multi-objective hyperparameter optimization to enhance credit risk assessment accuracy and reliability.
Contribution
It introduces a systematic workflow combining data encoding, ensemble learning, and genetic algorithm-based hyperparameter tuning specifically for credit default prediction.
Findings
Improved accuracy in credit default prediction models.
Effective handling of data preprocessing with Weight of Evidence encoding.
Enhanced model robustness through ensemble techniques and multi-objective optimization.
Abstract
Due to the recent increase in interest in Financial Technology (FinTech), applications like credit default prediction (CDP) are gaining significant industrial and academic attention. In this regard, CDP plays a crucial role in assessing the creditworthiness of individuals and businesses, enabling lenders to make informed decisions regarding loan approvals and risk management. In this paper, we propose a workflow-based approach to improve CDP, which refers to the task of assessing the probability that a borrower will default on his or her credit obligations. The workflow consists of multiple steps, each designed to leverage the strengths of different techniques featured in machine learning pipelines and, thus best solve the CDP task. We employ a comprehensive and systematic approach starting with data preprocessing using Weight of Evidence encoding, a technique that ensures in a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFinancial Distress and Bankruptcy Prediction · FinTech, Crowdfunding, Digital Finance · Imbalanced Data Classification Techniques
