Multi-sample Comparison Using Spatial Signs for Infinite Dimensional Data
Joydeep Chowdhury, Probal Chaudhuri

TL;DR
This paper introduces a novel spatial signs-based test for analyzing variance in infinite-dimensional data, such as functional data, demonstrating superior performance over mean-based tests in various models and real datasets.
Contribution
The paper develops a new spatial signs-based test for functional data analysis, with asymptotic, bootstrap, and permutation implementations, outperforming existing mean-based tests in diverse scenarios.
Findings
The test outperforms mean-based tests in non-Gaussian models with heavy tails or skewness.
The test performs well even in some Gaussian models.
It shows strong results on real datasets including weather, chemical, and orthotic measurements.
Abstract
We consider an analysis of variance type problem, where the sample observations are random elements in an infinite dimensional space. This scenario covers the case, where the observations are random functions. For such a problem, we propose a test based on spatial signs. We develop an asymptotic implementation as well as a bootstrap implementation and a permutation implementation of this test and investigate their size and power properties. We compare the performance of our test with that of several mean based tests of analysis of variance for functional data studied in the literature. Interestingly, our test not only outperforms the mean based tests in several non-Gaussian models with heavy tails or skewed distributions, but in some Gaussian models also. Further, we also compare the performance of our test with the mean based tests in several models involving contaminated probability…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoil Geostatistics and Mapping · Bayesian Methods and Mixture Models · Data-Driven Disease Surveillance
