Inference for linear functionals of high-dimensional longitudinal proteomics data using generalized estimating equations
Lu Xia, Ali Shojaie

TL;DR
This paper introduces a new statistical inference method for high-dimensional longitudinal proteomics data, enabling accurate confidence intervals for protein associations with COVID-19 severity, addressing challenges of correlated high-dimensional data.
Contribution
It proposes a novel inference procedure for linear functionals of high-dimensional regression coefficients using generalized estimating equations, with a data-driven tuning parameter selection method.
Findings
Method provides asymptotically normal estimators under mild conditions.
Accurate confidence intervals for protein-COVID associations demonstrated.
Robust performance in simulations, especially in bias and coverage.
Abstract
Regression analysis of correlated data, where multiple correlated responses are recorded on the same unit, is ubiquitous in many scientific areas. With the advent of new technologies, in particular high-throughput omics profiling assays, such correlated data increasingly consist of large number of variables compared with the available sample size. Motivated by recent longitudinal proteomics studies of COVID-19, we propose a novel inference procedure for linear functionals of high-dimensional regression coefficients in generalized estimating equations, which are widely used to analyze correlated data. Our estimator for this more general inferential target, obtained via constructing projected estimating equations, is shown to be asymptotically normally distributed under mild regularity conditions. We also introduce a data-driven cross-validation procedure to select the tuning parameter…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Gene expression and cancer classification · SARS-CoV-2 detection and testing
