Construction of Multiple Constrained DNA Codes
Siddhartha Siddhiprada Bhoi, Paramapalli Udaya, Abhay Kumar Singh

TL;DR
This paper develops new DNA code families that minimize secondary structures and homopolymer runs, improving stability and performance for DNA data storage and computing.
Contribution
It introduces families of DNA codes with limited secondary structures and homopolymer runs, mapping error-correcting codes over ield;11ield to DNA nucleotides, achieving higher rates.
Findings
DNA codes with stem length at most two
Homopolymer run length at most four
Rates up to 0.5765 times the original code rate
Abstract
DNA sequences are prone to creating secondary structures by folding back on themselves by non-specific hybridization among its nucleotides. The formation of secondary structures makes the sequences chemically inactive towards synthesis and sequencing processes. In this letter, our goal is to tackle the problems due to the creation of secondary structures in DNA sequences along with constraints such as not having a large homopolymer run length. In this paper, we have presented families of DNA codes with secondary structures of stem length at most two and homopolymer run length at most four. By mapping the error correcting codes over to DNA nucleotides, we obtained DNA codes with rates times the rate of corresponding code over , which include some new secondary structure free and better-performing codes for DNA based data storage and DNA computing purposes.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDNA and Biological Computing · Advanced biosensing and bioanalysis techniques · DNA and Nucleic Acid Chemistry
