Central subspace data depth
Giacomo Francisci, Claudio Agostinelli

TL;DR
This paper introduces a new class of data depth measures that identify and order data points relative to a subspace of symmetry, extending traditional point-centered depths to subspace-centered depths for multivariate data analysis.
Contribution
It proposes a general framework for central subspace data depths, explores their properties, and demonstrates their application in data analysis and fraud detection.
Findings
Depths attain maximum at subspace of symmetry
Asymptotic convergence of sample depths established
Application to data fraud detection illustrates effectiveness
Abstract
Statistical data depth plays an important role in the analysis of multivariate data sets. The main outcome is a center-outward ordering of the observations that can be used both to highlight features of the underlying distribution of the data and as input to further statistical analysis. An important property of data depth is related to symmetric distributions as the point with the highest depth value, the center, coincides with the point of symmetry. However, there are applications in which it is more natural to consider symmetry with respect to a subspace of a certain dimension rather than to a point, i.e. a subspace of dimension zero. We provide a general framework to construct statistical data depths which attain maximum value in a subspace, providing a center-outward ordering from that subspace. We refer to these data depths as central subspace data depths. Moreover, if the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Statistical Methods and Models · Imbalanced Data Classification Techniques · Anomaly Detection Techniques and Applications
