Doubly stochastic continuous-time hidden Markov approach for analyzing genome tiling arrays
W. Evan Johnson, X. Shirley Liu, Jun S. Liu

TL;DR
This paper introduces a novel doubly stochastic continuous-time hidden Markov model tailored for analyzing genome tiling array data, effectively addressing artifacts and considering genomic distances to improve transcript and protein binding site detection.
Contribution
The proposed model uniquely incorporates genomic distances and robustness to artifacts, advancing tiling array analysis methods beyond existing approaches.
Findings
High accuracy in transcript and binding site detection
Robustness to cross-hybridization and nonresponsive probes
Effective on single samples without control experiments
Abstract
Microarrays have been developed that tile the entire nonrepetitive genomes of many different organisms, allowing for the unbiased mapping of active transcription regions or protein binding sites across the entire genome. These tiling array experiments produce massive correlated data sets that have many experimental artifacts, presenting many challenges to researchers that require innovative analysis methods and efficient computational algorithms. This paper presents a doubly stochastic latent variable analysis method for transcript discovery and protein binding region localization using tiling array data. This model is unique in that it considers actual genomic distance between probes. Additionally, the model is designed to be robust to cross-hybridized and nonresponsive probes, which can often lead to false-positive results in microarray experiments. We apply our model to a transcript…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
