The complexity of string partitioning
Anne Condon, J\'an Ma\v{n}uch, Chris Thachuk

TL;DR
This paper proves that the string partition problem, which involves dividing strings into segments without collisions, is NP-complete even under restrictive conditions, highlighting its computational difficulty relevant to synthetic biology.
Contribution
The paper establishes the NP-completeness of the string partition problem for various collision definitions and restrictions, including binary alphabets, advancing understanding of its computational complexity.
Findings
String partition problem is NP-complete under multiple conditions.
Hardness persists even with binary alphabet restrictions.
Implications for oligo design in gene synthesis.
Abstract
Given a string over a finite alphabet and an integer , can be partitioned into strings of length at most , such that there are no \emph{collisions}? We refer to this question as the \emph{string partition} problem and show it is \NP-complete for various definitions of collision and for a number of interesting restrictions including . This establishes the hardness of an important problem in contemporary synthetic biology, namely, oligo design for gene synthesis.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRNA and protein synthesis mechanisms · Genomics and Phylogenetic Studies · Algorithms and Data Compression
