Deterministic Polynomial-Time Algorithms for Designing Short DNA Words
Ming-Yang Kao, Henry C. M. Leung, He Sun, Yong Zhang

TL;DR
This paper introduces deterministic polynomial-time algorithms for designing short DNA words that meet specific constraints, improving upon previous randomized methods by producing shorter, guaranteed solutions for applications in DNA nanotechnology.
Contribution
The paper presents the first deterministic polynomial-time algorithms for DNA word design, achieving shorter word lengths and handling more constraints than prior randomized approaches.
Findings
Constructed DNA words of length approximately 2.1 log n + 6.28 k
Algorithms guarantee constraint satisfaction with polynomial runtime
Extended algorithms to handle additional constraints beyond previous methods
Abstract
Designing short DNA words is a problem of constructing a set (i.e., code) of n DNA strings (i.e., words) with the minimum length such that the Hamming distance between each pair of words is at least k and the n words satisfy a set of additional constraints. This problem has applications in, e.g., DNA self-assembly and DNA arrays. Previous works include those that extended results from coding theory to obtain bounds on code and word sizes for biologically motivated constraints and those that applied heuristic local searches, genetic algorithms, and randomized algorithms. In particular, Kao, Sanghi, and Schweller (2009) developed polynomial-time randomized algorithms to construct n DNA words of length within a multiplicative constant of the smallest possible word length (e.g., 9 max{log n, k}) that satisfy various sets of constraints with high probability. In this paper, we give…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDNA and Biological Computing · Advanced biosensing and bioanalysis techniques · semigroups and automata theory
