Quasi-Linear-Time Algorithm for Longest Common Circular Factor

Mai Alzamel; Maxime Crochemore; Costas S. Iliopoulos; Tomasz; Kociumaka; Jakub Radoszewski; Wojciech Rytter; Juliusz Straszy\'nski; Tomasz; Wale\'n; Wiktor Zuba

arXiv:1901.11305·cs.DS·February 1, 2019

Quasi-Linear-Time Algorithm for Longest Common Circular Factor

Mai Alzamel, Maxime Crochemore, Costas S. Iliopoulos, Tomasz, Kociumaka, Jakub Radoszewski, Wojciech Rytter, Juliusz Straszy\'nski, Tomasz, Wale\'n, Wiktor Zuba

PDF

TL;DR

This paper presents a novel algorithm that efficiently computes the longest common circular factor between two strings, extending the classic longest common factor problem with a cyclic shift consideration, in near-linear time.

Contribution

The paper introduces the LCCF problem and provides the first quasi-linear time algorithm to solve it, advancing string similarity measures.

Findings

01

LCCF can be computed in O(n log^5 n) time.

02

The algorithm extends classic string matching techniques.

03

LCCF serves as a new similarity measure for strings.

Abstract

We introduce the Longest Common Circular Factor (LCCF) problem in which, given strings $S$ and $T$ of length $n$ , we are to compute the longest factor of $S$ whose cyclic shift occurs as a factor of $T$ . It is a new similarity measure, an extension of the classic Longest Common Factor. We show how to solve the LCCF problem in $O (n lo g^{5} n)$ time.

Equations23

Γ_{y, z} and Δ_{y, z} and Γ_{t, x} and Δ_{t, x} .

Γ_{y, z} and Δ_{y, z} and Γ_{t, x} and Δ_{t, x} .

LeftWin_{k} (W, i)

LeftWin_{k} (W, i)

RightWin_{k} (W, i)

LeftSync_{k} (W, i)

LeftSync_{k} (W, i)

RightSync_{k} (W, i)

Ψ_{W} (α, i, β) = (W [first (α) . . i), W [i . . first (β)))

Ψ_{W} (α, i, β) = (W [first (α) . . i), W [i . . first (β)))

CAND_{a, b} (W) = {Ψ_{W} (α, i, β) : α \in LeftSync_{a} (W, i), β \in RightSync_{b}^{'} (W, i), i \in [1 . . ∣ W ∣]} .

CAND_{a, b} (W) = {Ψ_{W} (α, i, β) : α \in LeftSync_{a} (W, i), β \in RightSync_{b}^{'} (W, i), i \in [1 . . ∣ W ∣]} .

i \sum ∣ LeftSync_{k} (W, i) ∣ = O (n), i \sum ∣ RightSync_{k} (W, i) ∣ = O (n) .

i \sum ∣ LeftSync_{k} (W, i) ∣ = O (n), i \sum ∣ RightSync_{k} (W, i) ∣ = O (n) .

CAND_{a, b} (W) = {Ψ_{W} (x, i, y) : x \in Lyn (LeftWin_{a} (W, i)), y \in Lyn^{'} (RightWin_{b} (W, i)), i \in [1 . . ∣ W ∣]} .

CAND_{a, b} (W) = {Ψ_{W} (x, i, y) : x \in Lyn (LeftWin_{a} (W, i)), y \in Lyn^{'} (RightWin_{b} (W, i)), i \in [1 . . ∣ W ∣]} .

∣ X ∣ - ∣ U ∣ < ∣ λ ∣ or ∣ Y ∣ - ∣ U ∣ < ∣ λ ∣.

∣ X ∣ - ∣ U ∣ < ∣ λ ∣ or ∣ Y ∣ - ∣ U ∣ < ∣ λ ∣.

{Ψ_{W} (α, i, x) : α \in LeftSync_{a} (W, i), x \in Lyn^{'} (RightWin_{b} (i)), i \in [1 . . ∣ W ∣]} \cup {Ψ_{W} (x, i, α) : x \in Lyn (LeftWin_{a} (i)), α \in RightSync_{b}^{'} (W, i), i \in [1 . . ∣ W ∣]}

{Ψ_{W} (α, i, x) : α \in LeftSync_{a} (W, i), x \in Lyn^{'} (RightWin_{b} (i)), i \in [1 . . ∣ W ∣]} \cup {Ψ_{W} (x, i, α) : x \in Lyn (LeftWin_{a} (i)), α \in RightSync_{b}^{'} (W, i), i \in [1 . . ∣ W ∣]}

RECT (x, y)

RECT (x, y)

RECT^{'} (z, t)

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Quasi-Linear-Time Algorithm

for Longest Common Circular Factor

Mai Alzamel

Department of Informatics, King’s College London, London, UK

[mai.alzamel,maxime.crochemore,costas.iliopoulos]@kcl.ac.uk

Maxime Crochemore

Department of Informatics, King’s College London, London, UK

[mai.alzamel,maxime.crochemore,costas.iliopoulos]@kcl.ac.uk

Costas S. Iliopoulos

Department of Informatics, King’s College London, London, UK

[mai.alzamel,maxime.crochemore,costas.iliopoulos]@kcl.ac.uk

Tomasz Kociumaka