Location and association measures for interval-valued data based on Mallows' distance
M. Ros\'ario Oliveira, Diogo Pinheiro, and Lina Oliveira

TL;DR
This paper extends measures of location and association for interval-valued data using Mallows' distance, providing explicit formulas and a new symbolic covariance matrix that captures dependencies between centers and ranges.
Contribution
It generalizes existing measures to any absolutely continuous distribution with finite second moment and derives explicit formulas for Mallows' distance in multidimensional interval spaces.
Findings
Explicit formulas for Mallows' distance in p-dimensional interval spaces.
A new symbolic covariance matrix capturing dependence between centers and ranges.
Empirical validation on real-world datasets demonstrating methodology flexibility.
Abstract
The growing demand to analyse large and complex datasets has spurred the development of Symbolic Data Analysis as a promising approach to address contemporary data challenges. Amongst these, interval-valued data introduces new theoretical and methodological questions that remain open. In this paper, we generalise measures of location and association for interval-valued random variables using Mallows' distance. Departing from restrictive assumptions such as uniform distributions over microdata, our proposal extends the barycentre approach to any absolutely continuous distribution with finite second moment. A key contribution is the derivation of explicit formulas for Mallows' distance in p-dimensional interval spaces. These formulas decompose into components for centres, ranges, and a novel cross-term that captures their interaction. This decomposition leads to a new theoretical…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFuzzy Systems and Optimization
