A transformer-based model for default prediction in mid-cap corporate   markets

Kamesh Korangi; Christophe Mues; Cristi\'an Bravo

arXiv:2111.09902·q-fin.GN·May 13, 2024

A transformer-based model for default prediction in mid-cap corporate markets

Kamesh Korangi, Christophe Mues, Cristi\'an Bravo

PDF

TL;DR

This paper introduces a transformer-based deep learning model for predicting default probabilities of mid-cap companies, leveraging multi-source time-series data, and demonstrates significant performance improvements over traditional methods.

Contribution

It adapts transformer models for credit risk prediction, introduces a multi-channel architecture with differential training, and provides interpretability tools like attention heat maps and Shapley importance rankings.

Findings

01

13% improvement in AUC over traditional models

02

Effective use of multi-source data for default prediction

03

Enhanced interpretability with attention heat maps and Shapley analysis

Abstract

In this paper, we study mid-cap companies, i.e. publicly traded companies with less than US $10 billion in market capitalisation. Using a large dataset of US mid-cap companies observed over 30 years, we look to predict the default probability term structure over the medium term and understand which data sources (i.e. fundamental, market or pricing data) contribute most to the default risk. Whereas existing methods typically require that data from different time periods are first aggregated and turned into cross-sectional features, we frame the problem as a multi-label time-series classification problem. We adapt transformer models, a state-of-the-art deep learning model emanating from the natural language processing domain, to the credit risk modelling setting. We also interpret the predictions of these models using attention heat maps. To optimise the model further, we present a custom…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.