Leveraging Multiple Online Sources for Accurate Income Verification
Chirag Mahapatra, Kedar Bellare

TL;DR
This paper introduces a combined deep learning and feature-engineering approach for accurate income verification using limited identity data from online sources, reducing errors significantly over existing methods.
Contribution
It presents a novel hybrid model that leverages online income records and deep neural networks for improved income verification accuracy.
Findings
Achieved a 3-6% reduction in verification error compared to baselines.
Validated the model on both simulated and real-world datasets.
Demonstrated the necessity of combining deep learning with hand-engineered features.
Abstract
Income verification is the problem of validating a person's stated income given basic identity information such as name, location, job title and employer. It is widely used in the context of mortgage lending, rental applications and other financial risk models. However, the current processes surrounding verification involve significant human effort and document gathering which can be both time-consuming and expensive. In this paper, we propose a novel model for verifying an individual's income given very limited identity information typically available in loan applications. Our model is a combination of a deep neural network and hand-engineered features. The hand engineered features are based upon matching the input information against income records extracted automatically from various publicly available online sources (e.g. payscale.com, H-1B filings, government employee salaries). We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFinancial Literacy, Pension, Retirement Analysis · FinTech, Crowdfunding, Digital Finance
