Credit Risk Estimation with Non-Financial Features: Evidence from a Synthetic Istanbul Dataset
Atalay Denknalbant, Emre Sezdi, Zeki Furkan Kutlu

TL;DR
This paper introduces a synthetic Istanbul dataset and demonstrates that incorporating behavioral features significantly improves credit risk prediction for underbanked populations, offering a transparent approach for fair lending.
Contribution
It provides an open synthetic dataset, a reproducible modeling pipeline, and empirical evidence that behavioral attributes enhance credit risk estimation for unbanked individuals.
Findings
Alternative features improve AUC by 1.3 percentage points.
Full models achieve F1 score of 0.95, up from 0.84.
Behavioral data approaches bureau-level discrimination power.
Abstract
Financial exclusion constrains entrepreneurship, increases income volatility, and widens wealth gaps. Underbanked consumers in Istanbul often have no bureau file because their earnings and payments flow through informal channels. To study how such borrowers can be evaluated we create a synthetic dataset of one hundred thousand Istanbul residents that reproduces first quarter 2025 T\"U\.IK (TURKSTAT) census marginals and telecom usage patterns. Retrieval augmented generation feeds these public statistics into the OpenAI o3 model, which synthesises realistic yet private records. Each profile contains seven socio demographic variables and nine alternative attributes that describe phone specifications, online shopping rhythm, subscription spend, car ownership, monthly rent, and a credit card flag. To test the impact of the alternative financial data CatBoost, LightGBM, and XGBoost are each…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMicrofinance and Financial Inclusion · Financial Literacy, Pension, Retirement Analysis · FinTech, Crowdfunding, Digital Finance
