Exact Multivariate Tests - A New Effective Principle of Controlled Model Choice
Juergen Laeuter, Maciej Rosolowski, Ekkehard Glimm

TL;DR
This paper introduces a new principle for controlled model choice in high-dimensional multivariate testing, improving error control by reinterpreting hypotheses on distributional sphericity, with applications in bioinformatics.
Contribution
It offers a novel mathematical interpretation of multivariate tests that allows for better error control in model selection without strict restrictions.
Findings
Enhanced control of the first kind error in multivariate tests
Applicable to all linear multivariate designs
Illustrated with gene set selection in bioinformatics
Abstract
High-dimensional tests are applied to find relevant sets of variables and relevant models. If variables are selected by analyzing the sums of products matrices and a corresponding mean-value test is performed, there is the danger that the nominal error of first kind is exceeded. In the paper, well-known multivariate tests receive a new mathematical interpretation such that the error of first kind of the combined testing and selecting procedure can more easily be kept. The null hypotheses on mean values are replaced by hypotheses on distributional sphericity of the individual score responses. Thus, model choice is possible without too strong restrictions. The method is presented for all linear multivariate designs. It is illustrated by an example from bioinformatics: The selection of gene sets for the comparison of groups of patients suffering from B-cell lymphomas.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsOptimal Experimental Design Methods · Advanced Statistical Modeling Techniques · Genetics and Plant Breeding
