Efficient Sparse Flow Decomposition Methods for RNA Multi-Assembly
Mathieu Besan\c{c}on

TL;DR
This paper introduces a novel, efficient method for Sparse Flow Decomposition in DAGs, crucial for RNA transcript assembly, using a conic hull formulation and Frank-Wolfe algorithm, outperforming existing integer optimization techniques.
Contribution
Proposes a new conic hull-based formulation of SFD and applies Frank-Wolfe algorithms for faster, parsimonious decompositions in RNA multi-assembly.
Findings
Outperforms recent integer optimization methods in runtime.
Achieves competitive transcript reconstruction accuracy.
Produces sparse decompositions without explicitly minimizing cardinality.
Abstract
Decomposing a flow on a Directed Acyclic Graph (DAG) into a weighted sum of a small number of paths is an essential task in operations research and bioinformatics. This problem, referred to as Sparse Flow Decomposition (SFD), has gained significant interest, in particular for its application in RNA transcript multi-assembly, the identification of the multiple transcripts corresponding to a given gene and their relative abundance. Several recent approaches cast SFD variants as integer optimization problems, motivated by the NP-hardness of the formulations they consider. We propose an alternative formulation of SFD as a data fitting problem on the conic hull of the flow polytope. By reformulating the problem on the flow polytope for compactness and solving it using specific variants of the Frank-Wolfe algorithm, we obtain a method converging rapidly to the minimizer of the chosen loss…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsModular Robots and Swarm Intelligence · Fuel Cells and Related Materials · Innovative Microfluidic and Catalytic Techniques Innovation
