Comparative Evaluation of Machine Learning Approaches for Minority-Class Financial Distress Prediction Under Class Imbalance Constraints

Karan Sehgal; Khawar Naveed Bhatti

arXiv:2605.14067·cs.LG·May 15, 2026

Comparative Evaluation of Machine Learning Approaches for Minority-Class Financial Distress Prediction Under Class Imbalance Constraints

Karan Sehgal, Khawar Naveed Bhatti

PDF

TL;DR

This study compares various machine learning methods, including ensemble and neural models, for predicting financial distress in highly imbalanced datasets, emphasizing interpretability and reproducibility.

Contribution

It provides a comprehensive evaluation of classical, ensemble, and neural approaches with imbalance mitigation and explainability techniques for financial distress prediction.

Findings

01

Gradient-boosting methods improved minority-class sensitivity.

02

SMOTE effectively mitigated class imbalance.

03

Workflow emphasizes reproducibility and interpretability.

Abstract

Financial distress prediction remains a significant challenge in enterprise risk analysis due to the highly imbalanced nature of real-world financial datasets, where bankrupt or distressed firms typically constitute only a small minority of observations. This paper presents a comparative evaluation of classical statistical methods, ensemble learning approaches, and exploratory neural models for minority-class financial distress prediction under class imbalance constraints. The study incorporates structured preprocessing, imbalance mitigation using the Synthetic Minority Oversampling Technique (SMOTE), comparative evaluation across ensemble learning architectures including XGBoost, CatBoost, LightGBM, Random Forest, and explainability analysis using SHAP-based feature attribution methods. Experimental evaluation demonstrates that gradient-boosting approaches achieved improved…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.