Sequence Annotation with HMMs: New Problems and Their Complexity
Michal N\'an\'asi, Tom\'a\v{s} Vina\v{r}, Bro\v{n}a Brejov\'a

TL;DR
This paper explores the computational complexity of three HMM decoding criteria, proving their NP-hardness, and demonstrates their practical usefulness in HIV recombination detection.
Contribution
It introduces and analyzes new HMM decoding criteria, establishing their NP-hardness and showing their application in biological sequence analysis.
Findings
Proved NP-hardness of three HMM decoding criteria
Demonstrated effectiveness in HIV recombination detection
Provided insights into sequence annotation complexity
Abstract
Hidden Markov models (HMMs) and their variants were successfully used for several sequence annotation tasks. Traditionally, inference with HMMs is done using the Viterbi and posterior decoding algorithms. However, recently a variety of different optimization criteria and associated computational problems were proposed. In this paper, we consider three HMM decoding criteria and prove their NP hardness. These criteria consider the set of states used to generate a certain sequence, but abstract from the exact locations of regions emitted by individual states. We also illustrate experimentally that these criteria are useful for HIV recombination detection.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAlgorithms and Data Compression · Machine Learning in Bioinformatics · RNA and protein synthesis mechanisms
