Construction of Multiple Constrained DNA Codes

Siddhartha Siddhiprada Bhoi; Paramapalli Udaya; Abhay Kumar Singh

arXiv:2211.16096·cs.IT·November 30, 2022

Construction of Multiple Constrained DNA Codes

Siddhartha Siddhiprada Bhoi, Paramapalli Udaya, Abhay Kumar Singh

PDF

Open Access

TL;DR

This paper develops new DNA code families that minimize secondary structures and homopolymer runs, improving stability and performance for DNA data storage and computing.

Contribution

It introduces families of DNA codes with limited secondary structures and homopolymer runs, mapping error-correcting codes over ield;11ield to DNA nucleotides, achieving higher rates.

Findings

01

DNA codes with stem length at most two

02

Homopolymer run length at most four

03

Rates up to 0.5765 times the original code rate

Abstract

DNA sequences are prone to creating secondary structures by folding back on themselves by non-specific hybridization among its nucleotides. The formation of secondary structures makes the sequences chemically inactive towards synthesis and sequencing processes. In this letter, our goal is to tackle the problems due to the creation of secondary structures in DNA sequences along with constraints such as not having a large homopolymer run length. In this paper, we have presented families of DNA codes with secondary structures of stem length at most two and homopolymer run length at most four. By mapping the error correcting codes over $Z_{11}$ to DNA nucleotides, we obtained DNA codes with rates $0.5765$ times the rate of corresponding code over $Z_{11}$ , which include some new secondary structure free and better-performing codes for DNA based data storage and DNA computing purposes.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDNA and Biological Computing · Advanced biosensing and bioanalysis techniques · DNA and Nucleic Acid Chemistry