Decomposition of variance in terms of conditional means
Alessandro Figa' Talamanca, Angelo Guerriero, Alberto Leone, Gian, Piero Mignoli, Enrico Rogora

TL;DR
This paper introduces a new method for analyzing variance by decomposing it into orthogonal components based on conditional means related to qualitative variables, demonstrated on educational data.
Contribution
It proposes a novel variance decomposition approach using differences of conditional means, complementing existing techniques and including an algorithm for natural ordering of qualitative factors.
Findings
Applied to student exam scores, revealing the influence of question correctness.
Analyzed university graduation delays with respect to demographic and educational factors.
Demonstrated the method's usefulness in understanding variable interdependence.
Abstract
We test against two different sets of data an apparently new approach to the analysis of the variance of a numerical variable which depends on qualitative characters. We suggest that this approach be used to complement other existing techniques to study the interdependence of the variables involved. According to our method the variance is expressed as a sum of orthogonal components, obtained as differences of conditional means, with respect to the qualitative characters. The resulting expression for the variance depends on the ordering in which the characters are considered. We suggest an algorithm which leads to an ordering which is deemed natural. The first set of data concerns the score achieved by a population of students, on an entrance examination, based on a multiple choice test with 30 questions. In this case the qualitative characters are dyadic and correspond to correct or…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Statistical Methods and Models · Statistical Methods and Inference · Multi-Criteria Decision Making
