Managing dataset shift by adversarial validation for credit scoring
Hongyi Qian, Baohui Wang, Ping Ma, Lei Peng, Songfeng Gao, You Song

TL;DR
This paper introduces an adversarial validation-based approach to address dataset shift in credit scoring, improving model generalization by selecting training samples with similar distribution to the test data.
Contribution
The paper proposes a novel adversarial validation method combined with a splicing technique to mitigate dataset shift in credit scoring models.
Findings
The proposed method outperforms other data split techniques in experiments.
Addressing dataset shift improves credit scoring model robustness.
The method effectively utilizes all available data for training.
Abstract
Dataset shift is common in credit scoring scenarios, and the inconsistency between the distribution of training data and the data that actually needs to be predicted is likely to cause poor model performance. However, most of the current studies do not take this into account, and they directly mix data from different time periods when training the models. This brings about two problems. Firstly, there is a risk of data leakage, i.e., using future data to predict the past. This can result in inflated results in offline validation, but unsatisfactory results in practical applications. Secondly, the macroeconomic environment and risk control strategies are likely to be different in different time periods, and the behavior patterns of borrowers may also change. The model trained with past data may not be applicable to the recent stage. Therefore, we propose a method based on adversarial…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImbalanced Data Classification Techniques · Adversarial Robustness in Machine Learning · Reservoir Engineering and Simulation Methods
