RMT-Net: Reject-aware Multi-Task Network for Modeling Missing-not-at-random Data in Financial Credit Scoring
Qiang Liu, Yingtao Luo, Shu Wu, Zhen Zhang, Xiangnan Yue, Hong Jin,, Liang Wang

TL;DR
This paper introduces RMT-Net, a multi-task learning model that effectively addresses missing-not-at-random bias in financial credit scoring by leveraging the correlation between rejection and default classification tasks.
Contribution
The paper proposes the first multi-task learning approach, RMT-Net, to model biased credit scoring data by jointly learning rejection and default tasks with a gating mechanism.
Findings
RMT-Net outperforms baseline models on multiple datasets.
RMT-Net++ further enhances performance in complex rejection scenarios.
The approach effectively utilizes rejection information to improve default prediction accuracy.
Abstract
In financial credit scoring, loan applications may be approved or rejected. We can only observe default/non-default labels for approved samples but have no observations for rejected samples, which leads to missing-not-at-random selection bias. Machine learning models trained on such biased data are inevitably unreliable. In this work, we find that the default/non-default classification task and the rejection/approval classification task are highly correlated, according to both real-world data study and theoretical analysis. Consequently, the learning of default/non-default can benefit from rejection/approval. Accordingly, we for the first time propose to model the biased credit scoring data with Multi-Task Learning (MTL). Specifically, we propose a novel Reject-aware Multi-Task Network (RMT-Net), which learns the task weights that control the information sharing from the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFinancial Distress and Bankruptcy Prediction · Imbalanced Data Classification Techniques
