Multi-Industry Simplex : A Probabilistic Extension of GICS
Maksim Papenkov, Chris Meredith, Claire Noel, Jai Padalkar, and Temple Hendrickson, Daniel Nitiutomo, Thomas Farrell

TL;DR
This paper introduces Multi-Industry Simplex, a probabilistic model that assigns firms to multiple industries based on business descriptions, improving industry classification for diversified conglomerates and asset management applications.
Contribution
The paper presents a novel probabilistic industry classification model using topic modeling, allowing firms to be associated with multiple industries with relevance probabilities.
Findings
Demonstrates the model's effectiveness in thematic portfolios
Shows improved classification for diversified firms
Provides interpretability through relevance probabilities
Abstract
Accurate industry classification is a critical tool for many asset management applications. While the current industry gold-standard GICS (Global Industry Classification Standard) has proven to be reliable and robust in many settings, it has limitations that cannot be ignored. Fundamentally, GICS is a single-industry model, in which every firm is assigned to exactly one group - regardless of how diversified that firm may be. This approach breaks down for large conglomerates like Amazon, which have risk exposure spread out across multiple sectors. We attempt to overcome these limitations by developing MIS (Multi-Industry Simplex), a probabilistic model that can flexibly assign a firm to as many industries as can be supported by the data. In particular, we utilize topic modeling, an natural language processing approach that utilizes business descriptions to extract and identify…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBig Data and Business Intelligence
