Visualizing Dependence in High-Dimensional Data: An Application to S&P 500 Constituent Data
Marius Hofert, Wayne Oldford

TL;DR
This paper introduces zenpaths and zenplots as new visualization tools to explore dependence structures in high-dimensional data, demonstrated through analysis of S&P 500 constituent data during the 2007-2008 financial crisis.
Contribution
The paper presents novel visualization methods, zenpaths and zenplots, for detecting and exploring dependence in high-dimensional datasets, with implementation in an R package.
Findings
Effective visualization of tail dependence in financial data
Identification of sector-specific dependence patterns during crisis
Tools applicable to finance, insurance, and risk management
Abstract
The notion of a zenpath and a zenplot is introduced to search and detect dependence in high-dimensional data for model building and statistical inference. By using any measure of dependence between two random variables (such as correlation, Spearman's rho, Kendall's tau, tail dependence etc.), a zenpath can construct paths through pairs of variables in different ways, which can then be laid out and displayed by a zenplot. The approach is illustrated by investigating tail dependence and model fit in constituent data of the S&P 500 during the financial crisis of 2007-2008. The corresponding Global Industry Classification Standard (GICS) sector information is also addressed. Zenpaths and zenplots are useful tools for exploring dependence in high-dimensional data, for example, from the realm of finance, insurance and quantitative risk management. All presented algorithms are implemented…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComplex Systems and Time Series Analysis
