Supervised similarity learning for corporate bonds using Random Forest proximities
Jerinsh Jeyapaulraj, Dhruv Desai, Peter Chu, Dhagash Mehta, Stefano, Pasquali, Philip Sommer

TL;DR
This paper introduces a supervised similarity learning framework for corporate bonds using Random Forest proximities, addressing challenges in noisy financial data and enabling more effective local and global similarity measures for portfolio management.
Contribution
It demonstrates that Random Forest can be effectively used as a similarity metric learning tool and proposes a novel similarity evaluation metric for corporate bonds.
Findings
RF outperforms other methods in similarity tasks
The framework captures both local and global similarities
A new metric for evaluating similarity is proposed
Abstract
Financial literature consists of ample research on similarity and comparison of financial assets and securities such as stocks, bonds, mutual funds, etc. However, going beyond correlations or aggregate statistics has been arduous since financial datasets are noisy, lack useful features, have missing data and often lack ground truth or annotated labels. However, though similarity extrapolated from these traditional models heuristically may work well on an aggregate level, such as risk management when looking at large portfolios, they often fail when used for portfolio construction and trading which require a local and dynamic measure of similarity on top of global measure. In this paper we propose a supervised similarity framework for corporate bonds which allows for inference based on both local and global measures. From a machine learning perspective, this paper emphasis that random…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStock Market Forecasting Methods · Machine Learning in Healthcare · Complex Systems and Time Series Analysis
