Optimal Rescaling and the Mahalanobis Distance
Przemys{\l}aw Spurek, Jacek Tabor

TL;DR
This paper generalizes the Mahalanobis distance by introducing a cross-entropy-based measure to find an optimal affine rescaling of data, improving upon classical normalization methods with a new formula for the transformation.
Contribution
It introduces a cross-entropy framework to optimize data rescaling, extending the Mahalanobis approach with a new formula for the covariance matrix that accounts for additional conditions.
Findings
Provides a new formula for optimal affine rescaling based on cross-entropy.
Shows that the classical Mahalanobis transformation is optimal when the origin is at the mean.
Generalizes the Mahalanobis distance to include additional constraints.
Abstract
One of the basic problems in data analysis lies in choosing the optimal rescaling (change of coordinate system) to study properties of a given data-set . The classical Mahalanobis approach has its basis in the classical normalization/rescaling formula , where denotes the mean of and the covariance matrix . Based on the cross-entropy we generalize this approach and define the parameter which measures the fit of a given affine rescaling of compared to the Mahalanobis one. This allows in particular to find an optimal change of coordinate system which satisfies some additional conditions. In particular we show that in the case when we put origin of coordinate system in the optimal choice is given by the transformation , where $$…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Statistical Methods and Models · Statistical Mechanics and Entropy · Neural Networks and Applications
