A Novel Combinatorial Method for Estimating Transcript Expression with RNA-Seq: Bounding the Number of Paths
Alexandru I. Tomescu, Anna Kuosmanen, Romeo Rizzi, and Veli M\"akinen

TL;DR
This paper introduces Traph, a new combinatorial algorithm for estimating transcript expression from RNA-Seq data, which efficiently handles the NP-hard problem of bounding the number of paths explaining the splicing graph.
Contribution
It presents a fast dynamic programming algorithm for a bounded-paths transcript estimation problem and implements heuristics that outperform existing tools on real and simulated data.
Findings
Traph achieves better performance than Cufflinks, IsoLasso, and SLIDE.
The algorithm effectively handles the NP-hard problem with practical heuristics.
Traph is available as an open-source tool for transcript quantification.
Abstract
RNA-Seq technology offers new high-throughput ways for transcript identification and quantification based on short reads, and has recently attracted great interest. The problem is usually modeled by a weighted splicing graph whose nodes stand for exons and whose edges stand for split alignments to the exons. The task consists of finding a number of paths, together with their expression levels, which optimally explain the coverages of the graph under various fitness functions, such least sum of squares. In (Tomescu et al. RECOMB-seq 2013) we showed that under general fitness functions, if we allow a polynomially bounded number of paths in an optimal solution, this problem can be solved in polynomial time by a reduction to a min-cost flow program. In this paper we further refine this problem by asking for a bounded number k of paths that optimally explain the splicing graph. This problem…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenomics and Phylogenetic Studies · RNA and protein synthesis mechanisms · Molecular Biology Techniques and Applications
