On Hardness of Jumbled Indexing

Amihood Amir; Timothy Chan; Moshe Lewenstein; Noa Lewenstein

arXiv:1405.0189·cs.DS·May 2, 2014

On Hardness of Jumbled Indexing

Amihood Amir, Timothy Chan, Moshe Lewenstein, Noa Lewenstein

PDF

Open Access

TL;DR

This paper demonstrates that, assuming 3SUM-hardness, efficient jumbled indexing with sub-quadratic preprocessing or query time is unlikely, especially for larger alphabets, explaining the difficulty of improving existing algorithms.

Contribution

The paper establishes conditional lower bounds on preprocessing and query times for jumbled indexing based on 3SUM-hardness assumptions, highlighting inherent computational challenges.

Findings

01

Under 3SUM-hardness, preprocessing must be nearly quadratic or query time nearly linear.

02

For fixed small alphabets, similar lower bounds apply, indicating fundamental complexity barriers.

03

Provides theoretical evidence explaining the stagnation in improving jumbled indexing algorithms.

Abstract

Jumbled indexing is the problem of indexing a text $T$ for queries that ask whether there is a substring of $T$ matching a pattern represented as a Parikh vector, i.e., the vector of frequency counts for each character. Jumbled indexing has garnered a lot of interest in the last four years. There is a naive algorithm that preprocesses all answers in $O (n^{2} ∣Σ∣)$ time allowing quick queries afterwards, and there is another naive algorithm that requires no preprocessing but has $O (n lo g ∣Σ∣)$ query time. Despite a tremendous amount of effort there has been little improvement over these running times. In this paper we provide good reason for this. We show that, under a 3SUM-hardness assumption, jumbled indexing for alphabets of size $ω (1)$ requires $Ω (n^{2 - ϵ})$ preprocessing time or $Ω (n^{1 - δ})$ query time for any $ϵ, δ > 0$ . In fact, under a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAlgorithms and Data Compression · Music and Audio Processing · Machine Learning and Algorithms