An End-to-End Coding Scheme for DNA-Based Data Storage With Nanopore-Sequenced Reads
Lorenz Welter, Roman Sokolovskii, Thomas Heinis, Antonia Wachter-Zeh,, Eirik Rosnes, Alexandre Graell i Amat

TL;DR
This paper introduces an end-to-end coding scheme for DNA data storage that effectively handles nanopore sequencing errors, achieving high performance through tailored channel modeling and soft decoding techniques.
Contribution
It proposes a novel concatenated coding scheme with a low-complexity decoder specifically designed for strand-dependent IDS errors in nanopore sequencing.
Findings
Achieves lower error rates compared to existing methods.
Demonstrates effective handling of strand-dependent errors.
Optimizes read/write cost trade-offs for DNA storage.
Abstract
We consider error-correcting coding for deoxyribonucleic acid (DNA)-based storage using nanopore sequencing. We model the DNA storage channel as a sampling noise channel where the input data is chunked into short DNA strands, which are copied a random number of times, and the channel outputs a random selection of noisy DNA strands. The retrieved DNA reads are prone to strand-dependent insertion, deletion, and substitution (IDS) errors. We construct an index-based concatenated coding scheme consisting of the concatenation of an outer code, an index code, and an inner code. We further propose a low-complexity (linear in ) maximum a posteriori probability decoder that takes into account the strand-dependent IDS errors and the randomness of the drawing to infer symbolwise a posteriori probabilities for the outer decoder. We present Monte-Carlo simulations for information-outage…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDNA and Biological Computing · Advanced biosensing and bioanalysis techniques · Quantum-Dot Cellular Automata
