TL;DR
This paper demonstrates how unsupervised machine learning can objectively classify US stocks into canonical sectors based on their return data, revealing the evolution of firms' sector participation over time.
Contribution
It introduces a data-driven method for sector classification that replaces manual, expert-based systems, using low-dimensional structures in stock return data.
Findings
Unsupervised learning identifies canonical market sectors from stock returns.
The method reveals how firms' sector participation weights change over time.
The approach offers a more objective sector classification compared to traditional systems.
Abstract
A classification of companies into sectors of the economy is important for macroeconomic analysis and for investments into the sector-specific financial indices and exchange traded funds (ETFs). Major industrial classification systems and financial indices have historically been based on expert opinion and developed manually. Here we show how unsupervised machine learning can provide a more objective and comprehensive broad-level sector decomposition of stocks. An emergent low-dimensional structure in the space of historical stock price returns automatically identifies "canonical sectors" in the market, and assigns every stock a participation weight into these sectors. Furthermore, by analyzing data from different periods, we show how these weights for listed firms have evolved over time.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
