# Reconstruction and Error-Correction Codes for Polymer-Based Data Storage

**Authors:** Srilakshmi Pattabiraman, Ryan Gabrys, and Olgica Milenkovic

arXiv: 1904.09280 · 2019-04-22

## TL;DR

This paper introduces a novel coding scheme for polymer-based data storage that enables unique string reconstruction and correction of single mass errors using minimal redundancy and efficient decoding algorithms.

## Contribution

It presents a new family of codes that combine combinatorial path interleaving with reflection techniques to achieve error correction and unique reconstruction with logarithmic redundancy.

## Key findings

- Codes allow unique reconstruction from erroneous substring multisets.
- Single mass error correction is achieved with minimal additional bits.
- Asymptotic code rate approaches one, with efficient decoding.

## Abstract

Motivated by polymer-based data-storage platforms that use chains of binary synthetic polymers as the recording media and read the content via tandem mass spectrometers, we propose a new family of codes that allows for unique string reconstruction and correction of one mass error. Our approach is based on introducing redundancy that scales logarithmically with the length of the string and allows for the string to be uniquely reconstructed based only on its erroneous substring composition multiset. The key idea behind our unique reconstruction approach is to interleave Catalan-type paths with arbitrary binary strings and "reflect" them so as to allow prefixes and suffixes of the same length to have different weights. For error correction, we add a constant number of bits that provides information about the weights of reflected pairs of bits and hence enable recovery from a single mass error. The asymptotic code rate of the scheme is one, and decoding is accomplished via a simplified version of the backtracking algorithm used for the Turnpike problem.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1904.09280/full.md

## Figures

2 figures with captions in the complete paper: https://tomesphere.com/paper/1904.09280/full.md

## References

10 references — full list in the complete paper: https://tomesphere.com/paper/1904.09280/full.md

---
Source: https://tomesphere.com/paper/1904.09280