A proposed simulation technique for population stability testing in credit risk scorecards
J. du Pisanie, J.S. Allison, I.J.H. Visagie

TL;DR
This paper introduces a simulation method for generating realistic credit risk scorecard data based on specifying bad ratios, aiding in population stability testing and model evaluation.
Contribution
It presents a novel simulation technique that uses bad ratios to generate multivariate data for credit scoring models, addressing the complexity of parameter specification.
Findings
Simulated data closely matches specified bad ratios.
Method effectively generates realistic multivariate credit data.
Provides practical R code for implementation.
Abstract
Credit risk scorecards are logistic regression models, fitted to large and complex data sets, employed by the financial industry to model the probability of default of a potential customer. In order to ensure that a scorecard remains a representative model of the population one tests the hypothesis of population stability; specifying that the distribution of clients' attributes remains constant over time. Simulating realistic data sets for this purpose is nontrivial as these data sets are multivariate and contain intricate dependencies. The simulation of these data sets are of practical interest for both practitioners and for researchers; practitioners may wish to consider the effect that a specified change in the properties of the data has on the scorecard and its usefulness from a business perspective, while researchers may wish to test a newly developed technique in credit scoring.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsProbability and Risk Models · Financial Distress and Bankruptcy Prediction
