Evaluating Optimal Safe Flows Decomposition for RNA Assembly
Bashar Ahmed, Siddharth Singh Rana, Ujjwal, Shahbaz Khan

TL;DR
This paper evaluates optimal flow decomposition algorithms for RNA assembly, demonstrating significant improvements in computational efficiency and output size, while also proposing heuristics to further enhance practical performance on large graphs.
Contribution
It provides a comprehensive evaluation of optimal algorithms and representations for safe flow decomposition, introducing heuristics to improve practical efficiency and scalability.
Findings
Optimal algorithms improve time by up to 70% and memory by up to 85%.
Optimal representations significantly increase output size by up to 170%.
Heuristics further improve performance with 10% additional gains.
Abstract
In Bioinformatics, the applications of flow decomposition in directed acyclic graphs are highlighted in RNA Assembly problem. However, it admits multiple solutions where exactly one solution correctly represents the underlying transcripts. The problem was addressed by Safe and Complete framework~[RECOMB16], which reports all the parts of the solution that are present in every possible solution. Khan et al.~[RECOMB22] first studied flow decomposition in the safe and complete framework. Their algorithm showed superior performance () over the popular heuristic (greedy-width) on sufficiently complex graphs for a unified metric of precision and coverage (F-score). They presented the solution in multiple representations using simple but suboptimal algorithms, which were later optimized by Khan and Tomescu~[ESA22], who also presented an optimal representation. In this paper, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRNA and protein synthesis mechanisms
