Two-sample tests for repeated measurements of histogram objects with applications to wearable device data
Jingru Zhang, Kathleen R. Merikangas, Hongzhe Li, Haochang Shou

TL;DR
This paper introduces new non-parametric graph-based two-sample tests designed for complex object data with repeated measures, such as histograms from wearable device data, demonstrating improved power and insights into biological variability.
Contribution
The paper develops novel non-parametric graph-based two-sample tests for repeated measures of complex object data, with derivation of asymptotic null distributions and demonstrated power improvements.
Findings
Tests outperform existing methods in simulations.
Methods control type I error in finite samples.
Application reveals variability in physical activity distributions.
Abstract
Repeated observations have become increasingly common in biomedical research and longitudinal studies. For instance, wearable sensor devices are deployed to continuously track physiological and biological signals from each individual over multiple days. It remains of great interest to appropriately evaluate how the daily distribution of biosignals might differ across disease groups and demographics. Hence these data could be formulated as multivariate complex object data such as probability densities, histograms, and observations on a tree. Traditional statistical methods would often fail to apply as they are sampled from an arbitrary non-Euclidean metric space. In this paper, we propose novel non-parametric graph-based two-sample tests for object data with repeated measures. A set of test statistics are proposed to capture various possible alternatives. We derive their asymptotic null…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNutritional Studies and Diet · Data-Driven Disease Surveillance · Statistical Methods and Inference
