Optimal Estimation of Simultaneous Signals Using Absolute Inner Product with Applications to Integrative Genomics
Rong Ma, T. Tony Cai, Hongzhe Li

TL;DR
This paper introduces a minimax rate-optimal estimator for the absolute inner product of Gaussian mean vectors, enhancing integrative genomics analysis by identifying genes linked to diseases through a novel $T$-score.
Contribution
It proposes a new estimator for the $T$-score based on approximation theory, which is shown to be minimax rate-optimal and adaptive across various parameter spaces.
Findings
Estimator outperforms existing methods in simulations
Applied to heart failure genomics data, identifying potential causal genes
Revealed biological pathways associated with heart failure
Abstract
Integrating the summary statistics from genome-wide association study (\textsc{gwas}) and expression quantitative trait loci (e\textsc{qtl}) data provides a powerful way of identifying the genes whose expression levels are potentially associated with complex diseases. A parameter called -score that quantifies the genetic overlap between a gene and the disease phenotype based on the summary statistics is introduced based on the mean values of two Gaussian sequences. Specifically, given two independent samples and , the -score is defined as , a non-smooth functional, which characterizes the amount of shared signals between two absolute normal mean vectors and . Using approximation theory, estimators are constructed and shown to be minimax rate-optimal and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
