TL;DR
This paper introduces a generalized similarity U (GSU) test for assessing associations between complex genetic and phenotypic data, demonstrating its theoretical robustness and practical effectiveness in genome-wide studies.
Contribution
The paper proposes a novel non-parametric similarity-based test, GSU, for complex object association testing, with theoretical analysis and application to sequencing data.
Findings
GSU outperforms existing methods in power and robustness.
Application to ADNI data identified three genes linked to imaging phenotypes.
Developed a C++ package for GSU analysis of whole genome sequencing data.
Abstract
Second generation sequencing technologies are being increasingly used for genetic association studies, where the main research interest is to identify sets of genetic variants that contribute to various phenotype. The phenotype can be univariate disease status, multivariate responses and even high-dimensional outcomes. Considering the genotype and phenotype as two complex objects, this also poses a general statistical problem of testing association between complex objects. We here proposed a similarity-based test, generalized similarity U (GSU), that can test the association between complex objects. We first studied the theoretical properties of the test in a general setting and then focused on the application of the test to sequencing association studies. Based on theoretical analysis, we proposed to use Laplacian kernel based similarity for GSU to boost power and enhance robustness.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
