Near-Optimal Average-Case Approximate Trace Reconstruction from Few Traces
Xi Chen, Anindya De, Chin Ho Lee, Rocco A. Servedio, Sandip Sinha

TL;DR
This paper introduces an efficient algorithm for approximate trace reconstruction of random binary strings from few traces, achieving near-optimal accuracy with a theoretical lower bound confirming the difficulty of the problem.
Contribution
It presents the first polynomial-time algorithm for approximate trace reconstruction with near-optimal error bounds for random strings, along with matching lower bounds.
Findings
Algorithm reconstructs with edit distance at most n*(δM)^{Ω(M)}
Lower bound shows minimal achievable expected edit distance is n*(δM)^{O(M)}
Works for any deletion rate 0<δ<1 with few traces
Abstract
In the standard trace reconstruction problem, the goal is to \emph{exactly} reconstruct an unknown source string from independent "traces", which are copies of that have been corrupted by a -deletion channel which independently deletes each bit of with probability and concatenates the surviving bits. We study the \emph{approximate} trace reconstruction problem, in which the goal is only to obtain a high-accuracy approximation of rather than an exact reconstruction. We give an efficient algorithm, and a near-matching lower bound, for approximate reconstruction of a random source string from few traces. Our main algorithmic result is a polynomial-time algorithm with the following property: for any deletion rate (which may depend on ), for almost every source…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMolecular Biology Techniques and Applications · Environmental DNA in Biodiversity Studies · Digital and Traditional Archives Management
