Genetic variant selection: learning across traits and sites

Laurel Stell; Chiara Sabatti

arXiv:1504.00946·stat.ME·April 6, 2016

Genetic variant selection: learning across traits and sites

Laurel Stell, Chiara Sabatti

PDF

TL;DR

This paper introduces Bayesian methods with novel priors for selecting genetic variants across traits and sites, improving prioritization in resequencing studies by leveraging correlations and shared information.

Contribution

It proposes two new prior distributions within a Bayesian multivariate linear regression framework to enhance variant prioritization by borrowing evidence across phenotypes and mutations.

Findings

01

Simulations show improved variant detection accuracy.

02

Re-analysis of sequencing data demonstrates practical benefits.

03

Bayesian approach effectively integrates multiple sources of evidence.

Abstract

We consider resequencing studies of associated loci and the problem of prioritizing sequence variants for functional follow-up. Working within the multivariate linear regression framework helps us to account for correlation across variants, and adopting a Bayesian approach naturally leads to posterior probabilities that incorporate all information about the variants' function. We describe two novel prior distributions that facilitate learning the role of each variant by borrowing evidence across phenotypes and across mutations in the same gene. We illustrate their potential advantages with simulations and re-analyzing a dataset of sequencing variants.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.