Analyzing the Fine Structure of Distributions
Michael C. Thrun, Tino Gehlert, Alfred Ultsch

TL;DR
This paper introduces the mirrored density plot (MD plot), a new visualization tool for analyzing univariate distributions that performs well without parameter tuning, especially in complex cases like multimodal or skewed data.
Contribution
The paper proposes the MD plot, a novel visualization method that effectively reveals distribution structures without requiring density estimation parameter adjustments.
Findings
MD plot outperforms traditional visualization tools in identifying distribution structures.
MD plot effectively visualizes bimodal, skewed, and clipped distributions.
In financial data analysis, MD plots uniquely identified distribution structures where statistical tests struggled.
Abstract
One aim of data mining is the identification of interesting structures in data. For better analytical results, the basic properties of an empirical distribution, such as skewness and eventual clipping, i.e. hard limits in value ranges, need to be assessed. Of particular interest is the question of whether the data originate from one process or contain subsets related to different states of the data producing process. Data visualization tools should deliver a clear picture of the univariate probability density distribution (PDF) for each feature. Visualization tools for PDFs typically use kernel density estimates and include both the classical histogram, as well as the modern tools like ridgeline plots, bean plots and violin plots. If density estimation parameters remain in a default setting, conventional methods pose several problems when visualizing the PDF of uniform, multimodal,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
