SPIDER-WEB generates coding algorithms with superior error tolerance and real-time information retrieval capacity
Haoling Zhang, Zhaojun Lan, Wenwei Zhang, Xun Xu, Zhi Ping, Yiwei, Zhang, Yue Shen

TL;DR
SPIDER-WEB is a graph-based architecture that automatically generates coding algorithms for DNA data storage, offering high error correction, real-time retrieval, and scalability for large datasets.
Contribution
It introduces an all-in-one, automatic algorithm generation method for DNA data storage, improving error correction and retrieval speed over existing approaches.
Findings
Corrects up to 4% edit errors with 5.5% redundancy
Achieves real-time retrieval 305 times faster than single-molecule sequencing
Scalable to exabyte-level data storage
Abstract
DNA has been considered a promising medium for storing digital information. As an essential step in the DNA-based data storage workflow, coding algorithms are responsible to implement functions including bit-to-base transcoding, error correction, etc. In previous studies, these functions are normally realized by introducing multiple algorithms. Here, we report a graph-based architecture, named SPIDER-WEB, providing an all-in-one coding solution by generating customized algorithms automatically. SPIDERWEB is able to correct a maximum of 4% edit errors in the DNA sequences including substitution and insertion/deletion (indel), with only 5.5% redundant symbols. Since no DNA sequence pretreatment is required for the correcting and decoding processes, SPIDER-WEB offers the function of real-time information retrieval, which is 305.08 times faster than the speed of single-molecule sequencing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDNA and Biological Computing · Algorithms and Data Compression · Advanced Data Storage Technologies
