A transformer-based model for default prediction in mid-cap corporate markets
Kamesh Korangi, Christophe Mues, Cristi\'an Bravo

TL;DR
This paper introduces a transformer-based deep learning model for predicting default probabilities of mid-cap companies, leveraging multi-source time-series data, and demonstrates significant performance improvements over traditional methods.
Contribution
It adapts transformer models for credit risk prediction, introduces a multi-channel architecture with differential training, and provides interpretability tools like attention heat maps and Shapley importance rankings.
Findings
13% improvement in AUC over traditional models
Effective use of multi-source data for default prediction
Enhanced interpretability with attention heat maps and Shapley analysis
Abstract
In this paper, we study mid-cap companies, i.e. publicly traded companies with less than US $10 billion in market capitalisation. Using a large dataset of US mid-cap companies observed over 30 years, we look to predict the default probability term structure over the medium term and understand which data sources (i.e. fundamental, market or pricing data) contribute most to the default risk. Whereas existing methods typically require that data from different time periods are first aggregated and turned into cross-sectional features, we frame the problem as a multi-label time-series classification problem. We adapt transformer models, a state-of-the-art deep learning model emanating from the natural language processing domain, to the credit risk modelling setting. We also interpret the predictions of these models using attention heat maps. To optimise the model further, we present a custom…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
