A Pseudo-Metric between Probability Distributions based on Depth-Trimmed Regions
Guillaume Staerman, Pavlo Mozharovskyi, Pierre Colombo, St\'ephan, Cl\'emen\c{c}on, Florence d'Alch\'e-Buc

TL;DR
This paper introduces a new pseudo-metric for probability distributions in Euclidean space using depth-trimmed regions, offering robustness, invariance properties, and an efficient approximation method demonstrated through numerical experiments.
Contribution
It proposes a novel depth-based pseudo-metric between probability distributions, extending univariate quantiles to multivariate spaces with robust and computationally efficient features.
Findings
The pseudo-metric exhibits good invariance and robustness properties.
An efficient linear-time approximation method is developed.
Numerical experiments validate the effectiveness of the proposed approach.
Abstract
The design of a metric between probability distributions is a longstanding problem motivated by numerous applications in Machine Learning. Focusing on continuous probability distributions on the Euclidean space , we introduce a novel pseudo-metric between probability distributions by leveraging the extension of univariate quantiles to multivariate spaces. Data depth is a nonparametric statistical tool that measures the centrality of any element with respect to (w.r.t.) a probability distribution or a data set. It is a natural median-oriented extension of the cumulative distribution function (cdf) to the multivariate case. Thus, its upper-level sets -- the depth-trimmed regions -- give rise to a definition of multivariate quantiles. The new pseudo-metric relies on the average of the Hausdorff distance between the depth-based quantile regions w.r.t. each…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Advanced Statistical Methods and Models · Probabilistic and Robust Engineering Design
