Codes with Biochemical Constraints and Single Error Correction for   DNA-Based Data Storage

Shu Liu; Chaoping Xing; Yaqian Zhang

arXiv:2307.00221·cs.IT·July 4, 2023·2 cites

Codes with Biochemical Constraints and Single Error Correction for DNA-Based Data Storage

Shu Liu, Chaoping Xing, Yaqian Zhang

PDF

Open Access

TL;DR

This paper develops DNA codes that incorporate biochemical constraints and error correction to improve data storage reliability, achieving higher information rates and addressing secondary structure avoidance, homopolymer limits, and GC-balance.

Contribution

It introduces new DNA code constructions that satisfy multiple biochemical constraints and include single error correction, surpassing previous codes in rate and robustness.

Findings

01

Constructed DNA codes with secondary structure avoidance and homopolymer limits.

02

Achieved higher information rates, e.g., 1.3206 bits/nt for specific parameters.

03

Presented codes with GC-locally balanced constraints.

Abstract

In DNA-based data storage, DNA codes with biochemical constraints and error correction are designed to protect data reliability. Single-stranded DNA sequences with secondary structure avoidance (SSA) help to avoid undesirable secondary structures which may cause chemical inactivity. Homopolymer run-length limit and GC-balanced limit also help to reduce the error probability of DNA sequences during synthesizing and sequencing. In this letter, based on a recent work \cite{bib7}, we construct DNA codes free of secondary structures of stem length $\geq m$ and have homopolymer run-length $\leq ℓ$ for odd $m \leq 11$ and $ℓ \geq 3$ with rate $1 + lo g_{2} ρ_{m} - 3/ (2^{ℓ - 1} + ℓ + 1)$ , where $ρ_{m}$ is in Table \ref{tm}. In particular, when $m = 3$ , $ℓ = 4$ , its rate tends to 1.3206 bits/nt, beating a previous work by Benerjee {\it et al.}. We also construct DNA codes with all of the above three…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDNA and Biological Computing · Advanced biosensing and bioanalysis techniques · Advanced Data Storage Technologies