Optimal mean-based algorithms for trace reconstruction

Anindya De; Ryan O'Donnell; Rocco Servedio

arXiv:1612.03148·cs.CC·December 12, 2016·2 cites

Optimal mean-based algorithms for trace reconstruction

Anindya De, Ryan O'Donnell, Rocco Servedio

PDF

Open Access

TL;DR

This paper establishes tight bounds for mean-based algorithms in trace reconstruction, showing they require exponential time and samples proportional to \\(n^{1/3}\\")

Contribution

It provides matching upper and lower bounds for mean-based trace reconstruction, extending results to various deletion probabilities and incorporating insertions and bit-flips.

Findings

01

Mean-based algorithms need exponential time and samples of order \\(exp(n^{1/3})\\")

02

Matching bounds are proven for deletion probabilities \$\delta \$ in different regimes

03

Insertions and bit-flips can be handled, with insertions aiding reconstruction when \\(\delta > 1/2\\"

Abstract

In the (deletion-channel) trace reconstruction problem, there is an unknown $n$ -bit source string $x$ . An algorithm is given access to independent traces of $x$ , where a trace is formed by deleting each bit of~ $x$ independently with probability~ $δ$ . The goal of the algorithm is to recover~ $x$ exactly (with high probability), while minimizing samples (number of traces) and running time. Previously, the best known algorithm for the trace reconstruction problem was due to Holenstein~et~al.; it uses $exp (\tilde{O} (n^{1/2}))$ samples and running time for any fixed $0 < δ < 1$ . It is also what we call a "mean-based algorithm", meaning that it only uses the empirical means of the individual bits of the traces. Holenstein~et~al.~also gave a lower bound, showing that any mean-based algorithm must use at least $n^{\tilde{Ω} (l o g n)}$ samples. In this paper we improve both of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAlgorithms and Data Compression · DNA and Biological Computing · Advanced Data Storage Technologies