Doubly stochastic continuous-time hidden Markov approach for analyzing   genome tiling arrays

W. Evan Johnson; X. Shirley Liu; Jun S. Liu

arXiv:0910.2090·stat.AP·October 13, 2009

Doubly stochastic continuous-time hidden Markov approach for analyzing genome tiling arrays

W. Evan Johnson, X. Shirley Liu, Jun S. Liu

PDF

TL;DR

This paper introduces a novel doubly stochastic continuous-time hidden Markov model tailored for analyzing genome tiling array data, effectively addressing artifacts and considering genomic distances to improve transcript and protein binding site detection.

Contribution

The proposed model uniquely incorporates genomic distances and robustness to artifacts, advancing tiling array analysis methods beyond existing approaches.

Findings

01

High accuracy in transcript and binding site detection

02

Robustness to cross-hybridization and nonresponsive probes

03

Effective on single samples without control experiments

Abstract

Microarrays have been developed that tile the entire nonrepetitive genomes of many different organisms, allowing for the unbiased mapping of active transcription regions or protein binding sites across the entire genome. These tiling array experiments produce massive correlated data sets that have many experimental artifacts, presenting many challenges to researchers that require innovative analysis methods and efficient computational algorithms. This paper presents a doubly stochastic latent variable analysis method for transcript discovery and protein binding region localization using tiling array data. This model is unique in that it considers actual genomic distance between probes. Additionally, the model is designed to be robust to cross-hybridized and nonresponsive probes, which can often lead to false-positive results in microarray experiments. We apply our model to a transcript…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.