A powerful transformation of quantitative responses for biobank-scale association studies
Yaowu Liu, Tianying Wang

TL;DR
This paper introduces a new transformation method for linear regression in large-scale genetic studies, improving power to detect weak signals while ensuring computational efficiency and error control.
Contribution
It proposes a novel error density-based transformation that is both powerful for hypothesis testing and scalable for biobank-scale data analysis.
Findings
Enhanced statistical power over existing methods.
Maintains strict control of type I error rates.
Validated on UK Biobank spirometry data.
Abstract
In linear regression models with non-Gaussian errors, transformations of the response variable are widely used in a broad range of applications. Motivated by various genetic association studies, transformation methods for hypothesis testing have received substantial interest. In recent years, the rise of biobank-scale genetic studies, which feature a vast number of participants that could be around half a million, spurred the need for new transformation methods that are both powerful for detecting weak genetic signals and computationally efficient for large-scale data. In this work, we propose a novel transformation method that leverages the information of the error density. This transformation leads to locally most powerful tests and therefore has strong power for detecting weak signals. To make the computation scalable to biobank-scale studies, we harnessed the nature of weak genetic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenetic Associations and Epidemiology · Statistical Methods and Inference · Genetic and phenotypic traits in livestock
