DNN2LR: Automatic Feature Crossing for Credit Scoring
Qiang Liu, Zhaocheng Liu, Haoli Zhang, Yuntian Chen, Jun Zhu

TL;DR
This paper introduces DNN2LR, an automatic feature crossing method that enhances logistic regression models with complex feature interactions derived from DNNs, improving accuracy and interpretability in credit scoring.
Contribution
The paper proposes a novel method to automatically identify feature interactions from DNNs and incorporate them into LR models, addressing efficiency issues in credit scoring applications.
Findings
DNN2LR outperforms traditional DNN and feature crossing methods.
It accelerates feature crossing by 10 to 40 times on large datasets.
The resulting LR model remains interpretable while achieving high accuracy.
Abstract
Credit scoring is a major application of machine learning for financial institutions to decide whether to approve or reject a credit loan. For sake of reliability, it is necessary for credit scoring models to be both accurate and globally interpretable. Simple classifiers, e.g., Logistic Regression (LR), are white-box models, but not powerful enough to model complex nonlinear interactions among features. Fortunately, automatic feature crossing is a promising way to find cross features to make simple classifiers to be more accurate without heavy handcrafted feature engineering. However, credit scoring is usually based on different aspects of users, and the data usually contains hundreds of feature fields. This makes existing automatic feature crossing methods not efficient for credit scoring. In this work, we find local piece-wise interpretations in Deep Neural Networks (DNNs) of a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Healthcare · Imbalanced Data Classification Techniques · Explainable Artificial Intelligence (XAI)
MethodsDNN2LR · Logistic Regression
