Consumer finance data generator - a new approach to Credit Scoring technique comparison
Karol Przanowski, Jolanta Mamczarz

TL;DR
This paper introduces a novel approach using data generators to compare credit scoring techniques, aiming to identify the most stable and effective method across different datasets.
Contribution
It proposes a data generator-based methodology for unbiased comparison of credit scoring techniques, addressing data availability limitations and model stability over time.
Findings
Comparison results of different scoring methods
Methodology for creating unbiased comparison datasets
Arguments favoring Logit and WOE approaches
Abstract
This paper aims to present a general idea of method comparison of Credit Scoring techniques. Any scorecard can be made in various methods based on variable transformations in the logistic regression model. To make a comparison and come up with the proof that one technique is better than another is a big challenge due to the limited availability of data. The same conclusion cannot be guaranteed when using other data from another source. The following research challenge can therefore be formulated: how should the comparison be managed in order to get general results that are not biased by particular data? The solution may be in the use of various random data generators. The data generator uses two approaches: transition matrix and scorings. Here are presented both: results of comparison methods and the methodology of these comparison techniques creating. Before building a new model the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsArtificial Intelligence in Healthcare · Big Data and Business Intelligence · Data Mining Algorithms and Applications
