High Information Density and Low Coverage Data Storage in DNA with Efficient Channel Coding Schemes
Yi Ding, Xuan He, Tuan Thanh Nguyen, Wentu Song, Zohar Yakhini, Eitan, Yaakobi, Linqiang Pan, Xiaohu Tang, Kui Cai

TL;DR
This paper presents a DNA data storage system using advanced error-correcting and constrained coding schemes, achieving high information density and reliable data recovery at low coverage levels through large-scale experiments.
Contribution
It introduces an efficient DNA storage architecture with novel coding schemes validated by large-scale experiments, reaching record-high information density and minimal coverage.
Findings
Achieved 1.731 and 1.815 bits per nucleotide density.
Successfully recovered data without errors at low coverage levels of 4.5 and 6.0.
Set new benchmarks for data density and coverage in DNA storage experiments.
Abstract
DNA-based data storage has been attracting significant attention due to its extremely high data storage density, low power consumption, and long duration compared to conventional data storage media. Despite the recent advancements in DNA data storage technology, significant challenges remain. In particular, various types of errors can occur during the processes of DNA synthesis, storage, and sequencing, including substitution errors, insertion errors, and deletion errors. Furthermore, the entire oligo may be lost. In this work, we report a DNA-based data storage architecture that incorporates efficient channel coding schemes, including different types of error-correcting codes (ECCs) and constrained codes, for both the inner coding and outer coding for the DNA data storage channel. We also carried out large scale experiments to validate our proposed DNA-based data storage architecture.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDNA and Biological Computing · Quantum Computing Algorithms and Architecture · Advanced biosensing and bioanalysis techniques
