Combining haplotypers

Matti K\"a\"ari\"ainen; Niels Landwehr; Sampsa Lappalainen; Taneli; Mielik\"ainen

arXiv:0710.5116·cs.LG·October 29, 2007·5 cites

Combining haplotypers

Matti K\"a\"ari\"ainen, Niels Landwehr, Sampsa Lappalainen, Taneli, Mielik\"ainen

PDF

Open Access

TL;DR

This paper explores combining multiple haplotype reconstruction methods to improve accuracy and robustness in gene mapping, addressing the challenge of selecting the best method for different population samples.

Contribution

It introduces several techniques for combining haplotype predictions and demonstrates their effectiveness on real data, outperforming individual methods.

Findings

01

Combined methods often outperform single methods in accuracy.

02

Techniques provide robustness against outliers.

03

Combining methods helps circumvent method selection issues.

Abstract

Statistically resolving the underlying haplotype pair for a genotype measurement is an important intermediate step in gene mapping studies, and has received much attention recently. Consequently, a variety of methods for this problem have been developed. Different methods employ different statistical models, and thus implicitly encode different assumptions about the nature of the underlying haplotype structure. Depending on the population sample in question, their relative performance can vary greatly, and it is unclear which method to choose for a particular sample. Instead of choosing a single method, we explore combining predictions returned by different methods in a principled way, and thereby circumvent the problem of method selection. We propose several techniques for combining haplotype reconstructions and analyze their computational properties. In an experimental study on…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGene expression and cancer classification · Algorithms and Data Compression · Bioinformatics and Genomic Networks