Cartesian Tree Subsequence Matching
Tsubasa Oizumi, Takeshi Kai, Takuya Mieno, Shunsuke Inenaga, Hiroki, Arimura

TL;DR
This paper introduces the Cartesian tree subsequence matching problem, providing an efficient algorithm with near-linear time complexity, contrasting it with the NP-hard order-preserving subsequence matching.
Contribution
The paper defines the CTMSeq problem and develops an efficient algorithm with $O(mn \, \log \log n)$ time complexity using dynamic predecessor data structures.
Findings
Efficient $O(mn \log \log n)$-time algorithm for CTMSeq.
Contrast with NP-hard order-preserving subsequence matching.
Provides a practical solution for Cartesian-tree based subsequence matching.
Abstract
Park et al. [TCS 2020] observed that the similarity between two (numerical) strings can be captured by the Cartesian trees: The Cartesian tree of a string is a binary tree recursively constructed by picking up the smallest value of the string as the root of the tree. Two strings of equal length are said to Cartesian-tree match if their Cartesian trees are isomorphic. Park et al. [TCS 2020] introduced the following Cartesian tree substring matching (CTMStr) problem: Given a text string of length and a pattern string of length , find every consecutive substring of a text string such that and Cartesian-tree match. They showed how to solve this problem in time. In this paper, we introduce the Cartesian tree subsequence matching (CTMSeq) problem, that asks to find every minimal substring of such that contains a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAlgorithms and Data Compression · Network Packet Processing and Optimization · Natural Language Processing Techniques
