Debiasing Alternative Data for Credit Underwriting Using Causal   Inference

Chris Lam

arXiv:2410.22382·q-fin.RM·November 1, 2024

Debiasing Alternative Data for Credit Underwriting Using Causal Inference

Chris Lam

PDF

Open Access

TL;DR

This paper introduces a causal inference-based method to debias alternative data in credit scoring, aiming to improve model fairness and accuracy across racial groups while ensuring legal compliance.

Contribution

It presents a novel approach combining causal inference with supervised learning to debias alternative data for credit underwriting, with theoretical nondiscrimination guarantees.

Findings

01

Improved model accuracy across racial groups.

02

Theoretically robust nondiscrimination guarantees.

03

Effective debiasing of alternative data in credit scoring.

Abstract

Alternative data provides valuable insights for lenders to evaluate a borrower's creditworthiness, which could help expand credit access to underserved groups and lower costs for borrowers. But some forms of alternative data have historically been excluded from credit underwriting because it could act as an illegal proxy for a protected class like race or gender, causing redlining. We propose a method for applying causal inference to a supervised machine learning model to debias alternative data so that it might be used for credit underwriting. We demonstrate how our algorithm can be used against a public credit dataset to improve model accuracy across different racial groups, while providing theoretically robust nondiscrimination guarantees.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsItaly: Economic History and Contemporary Issues · Credit Risk and Financial Regulations · Banking stability, regulation, efficiency

MethodsCausal inference