Block Length Gain for Nanopore Channels
Yu-Ting Lin, Hsin-Po Wang, Venkatesan Guruswami

TL;DR
This paper extends Geno-Weaving, a coding strategy for DNA data storage, to effectively handle deletion errors, demonstrating that it overcomes finite-length penalties and performs well under realistic deletion rates.
Contribution
The paper introduces an extension of Geno-Weaving that combats deletion errors and shows it eliminates finite-length penalties, improving DNA storage reliability.
Findings
Geno-Weaving effectively handles deletion errors.
Finite-length penalty vanishes with large number of strands.
Works well at realistic deletion rates of 0.1%–10%.
Abstract
DNA is an attractive candidate for data storage. Its millennial durability and nanometer scale offer exceptional data density and longevity. Its relevance to medical applications also drives advances in DNA-related biotechnology. To protect our data against errors, a straightforward approach uses one error-correcting code per DNA strand, with a Reed--Solomon code protecting the collection of strands. A downside is that current technology can only synthesize strands 200--300 nucleotides long. At this block length, the inner code rate suffers a significant finite-length penalty, making its effective capacity hard to characterize. Last year, we proposed in a JSAIT publication. The idea is to protect the same position across multiple strands using one code; this provably achieves capacity against substitution errors. In this paper, we extend the idea to combat…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDNA and Biological Computing · Advanced biosensing and bioanalysis techniques · Quantum Computing Algorithms and Architecture
