Variability of qualitative variables: A Hilbert space formulation
Juan D. Botero, Leonardo A. Pach\'on

TL;DR
This paper introduces a novel Hilbert space formalism for analyzing qualitative variable diversity, enabling robust, interpretable category reduction and handling simultaneous categories in data analysis.
Contribution
It presents a new mathematical framework using Hilbert spaces for diversity measures, including strategies for category reduction and robustness against uncertainty.
Findings
Formalism effectively addresses category reduction issues.
Normalization of diversity measures improves interpretability.
Framework accommodates simultaneous categories in data analysis.
Abstract
A new formalism to express and operate on diversity measures of qualitative variables, built in a Hilbert space, is presented. The abstract character of the Hilbert space naturally incorporates the equivalence between qualitative variables and is utilized here to (i) represent the binary character of answers to categories and (ii) introduce a new criterium for choosing between different measures of diversity, namely, robustness against uncertainty. The full potential of the formulation on a Hilbert space comes to play when addressing the reduction of categories problem, a common problem in data analysis. The present formalism solves the problem by incorporating strategies inspired by mathematical and physical statistics, specifically, it makes use of the concept of partial trace. In solving this problem, it is shown that properly normalizing the diversity measures is instrumental to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGender Diversity and Inequality
