A Generalized Trace Reconstruction Problem: Recovering a String of Probabilities
Joey Rivkin, Gregory Valiant, Paul Valiant

TL;DR
This paper generalizes the trace reconstruction problem to probabilistic strings with deletion, establishing bounds for worst-case scenarios and demonstrating efficient recovery for random strings under certain conditions.
Contribution
It introduces a new probabilistic trace reconstruction model and provides both lower bounds for worst-case strings and efficient algorithms for random strings.
Findings
No algorithms can approximate worst-case strings with fewer than 2^{Ω(√n)} traces for certain deletion probabilities.
Random strings can be reconstructed efficiently with polynomially many traces when deletion probability is small.
Fourier analysis is used to establish indistinguishability and lower bounds in the model.
Abstract
We introduce the following natural generalization of trace reconstruction, parameterized by a deletion probability and length : There is a length string of probabilities, and each "trace" is obtained by 1) sampling a length binary string whose th coordinate is independently set to 1 with probability and 0 otherwise, and then 2) deleting each of the binary values independently with probability , and returning the corresponding binary string of length . The goal is to recover an estimate of from a set of independently drawn traces. In the case that all this is the standard trace reconstruction problem. We show two complementary results. First, for worst-case strings and any deletion probability at least order , no algorithm can approximate to constant distance…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms
