Insertion and Deletion Correction in Polymer-based Data Storage
Anisha Banerjee, Antonia Wachter-Zeh, Eitan Yaakobi

TL;DR
This paper explores error correction in polymer-based data storage, focusing on insertion and deletion errors in composition multisets, and proposes new coding constraints to improve robustness.
Contribution
It generalizes existing error models by including insertions and deletions, and develops new coding constraints for improved error correction in polymer storage.
Findings
Analysis of the robustness of existing reconstruction codebooks to insertion and deletion errors.
Proposal of new coding constraints to correct insertion and deletion errors in composition multisets.
Enhanced understanding of error models in polymer-based data storage systems.
Abstract
Synthetic polymer-based storage seems to be a particularly promising candidate that could help to cope with the ever-increasing demand for archival storage requirements. It involves designing molecules of distinct masses to represent the respective bits , followed by the synthesis of a polymer of molecular units that reflects the order of bits in the information string. Reading out the stored data requires the use of a tandem mass spectrometer, that fragments the polymer into shorter substrings and provides their corresponding masses, from which the \emph{composition}, i.e. the number of s and s in the concerned substring can be inferred. Prior works have dealt with the problem of unique string reconstruction from the set of all possible compositions, called \emph{composition multiset}. This was accomplished either by determining which string lengths always allow unique…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
