Codes with Biochemical Constraints and Single Error Correction for DNA-Based Data Storage
Shu Liu, Chaoping Xing, Yaqian Zhang

TL;DR
This paper develops DNA codes that incorporate biochemical constraints and error correction to improve data storage reliability, achieving higher information rates and addressing secondary structure avoidance, homopolymer limits, and GC-balance.
Contribution
It introduces new DNA code constructions that satisfy multiple biochemical constraints and include single error correction, surpassing previous codes in rate and robustness.
Findings
Constructed DNA codes with secondary structure avoidance and homopolymer limits.
Achieved higher information rates, e.g., 1.3206 bits/nt for specific parameters.
Presented codes with GC-locally balanced constraints.
Abstract
In DNA-based data storage, DNA codes with biochemical constraints and error correction are designed to protect data reliability. Single-stranded DNA sequences with secondary structure avoidance (SSA) help to avoid undesirable secondary structures which may cause chemical inactivity. Homopolymer run-length limit and GC-balanced limit also help to reduce the error probability of DNA sequences during synthesizing and sequencing. In this letter, based on a recent work \cite{bib7}, we construct DNA codes free of secondary structures of stem length and have homopolymer run-length for odd and with rate , where is in Table \ref{tm}. In particular, when , , its rate tends to 1.3206 bits/nt, beating a previous work by Benerjee {\it et al.}. We also construct DNA codes with all of the above three…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDNA and Biological Computing · Advanced biosensing and bioanalysis techniques · Advanced Data Storage Technologies
