Statistical Inference for Maximin Effects: Identifying Stable Associations across Multiple Studies
Zijian Guo

TL;DR
This paper develops a statistical framework for identifying stable, generalizable associations across multiple high-dimensional datasets by inferring maximin effects, with a novel sampling method for valid confidence intervals, demonstrated on genetic data.
Contribution
It introduces a new inference method for maximin effects in multi-source high-dimensional data, including a sampling technique for confidence intervals with parametric length.
Findings
Genetic variants with significant maximin effects are generalizable to new environments.
The proposed confidence intervals are valid and have parametric length.
The method effectively identifies stable associations across diverse populations.
Abstract
Integrative analysis of data from multiple sources is critical to making generalizable discoveries. Associations that are consistently observed across multiple source populations are more likely to be generalized to target populations with possible distributional shifts. In this paper, we model the heterogeneous multi-source data with multiple high-dimensional regressions and make inferences for the maximin effect (Meinshausen, B{\"u}hlmann, AoS, 43(4), 1801--1830). The maximin effect provides a measure of stable associations across multi-source data. A significant maximin effect indicates that a variable has commonly shared effects across multiple source populations, and these shared effects may be generalized to a broader set of target populations. There are challenges associated with inferring maximin effects because its point estimator can have a non-standard limiting distribution.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGene expression and cancer classification · Genetic and phenotypic traits in livestock · Genetic Mapping and Diversity in Plants and Animals
