Approximate Trace Reconstruction via Median String (in Average-Case)
Diptarka Chakraborty, Debarati Das, and Robert Krauthgamer

TL;DR
This paper introduces a near-linear time algorithm for approximate trace reconstruction in the average-case, using only three traces to recover a string within a small edit distance, significantly improving efficiency and accuracy.
Contribution
It presents the first near-linear time algorithm for approximate trace reconstruction with three traces in the average-case, utilizing median string computation.
Findings
Algorithm runs in near-linear time $ ilde O(n)$
Reports a string within $O( ilde{p} n)$ edit distance from the original
Uses only three traces for reconstruction
Abstract
We consider an \emph{approximate} version of the trace reconstruction problem, where the goal is to recover an unknown string from traces (each trace is generated independently by passing through a probabilistic insertion-deletion channel with rate ). We present a deterministic near-linear time algorithm for the average-case model, where is random, that uses only \emph{three} traces. It runs in near-linear time and with high probability reports a string within edit distance from for , which significantly improves over the straightforward bound of . Technically, our algorithm computes a -approximate median of the three input traces. To prove its correctness, our probabilistic analysis shows that an approximate median is indeed close to the unknown . To achieve a near-linear…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
