Complex DNA Synthesis Sequences
Boaz Moav, Ryan Gabrys, Eitan Yaakobi

TL;DR
This paper introduces a hybrid DNA synthesis framework that generalizes existing methods, analyzes its information capacity, and develops algorithms for optimal synthesis sequences, advancing the theoretical understanding of constrained DNA storage.
Contribution
It presents a unified synthesis model, derives maximal information rates, and designs algorithms for optimal sequences, bridging theoretical gaps and extending previous models.
Findings
Derived tight expressions for maximal information rate.
Designed dynamic programming algorithms for optimal synthesis sequences.
Extended synthesis models to capture new structural limitations.
Abstract
DNA-based storage offers unprecedented density and durability, but its scalability is fundamentally limited by the efficiency of parallel strand synthesis. Existing methods either allow unconstrained nucleotide additions to individual strands, such as enzymatic synthesis, or enforce identical additions across many strands, such as photolithographic synthesis. We introduce and analyze a hybrid synthesis framework that generalizes both approaches: in each cycle, a nucleotide is selected from a restricted subset and incorporated in parallel. This model gives rise to a new notion of a complex synthesis sequence. Building on this framework, we extend the information rate definition of Lenz et al. and analyze an analog of the deletion ball, defined and studied in this setting, deriving tight expressions for the maximal information rate and its asymptotic behavior. These results bridge the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
