Fast Average-Case Pattern Matching on Weighted Sequences
Carl Barton, Chang Liu, Solon P. Pissis

TL;DR
This paper introduces algorithms for weighted string matching that achieve average-case sublinear search times, significantly improving efficiency for uncertain sequences with specific weight ratios.
Contribution
It presents novel algorithms for weighted pattern matching with average-case sublinear search times under certain weight ratio conditions.
Findings
Algorithms achieve average-case search time o(n) and o(σn)
Work in linear preprocessing time and space
Applicable for specific weight ratio thresholds
Abstract
A weighted string over an alphabet of size is a string in which a set of letters may occur at each position with respective occurrence probabilities. Weighted strings, also known as position weight matrices or uncertain sequences, naturally arise in many contexts. In this article, we study the problem of weighted string matching with a special focus on average-case analysis. Given a weighted pattern string of length , a text string of length , and a cumulative weight threshold , defined as the minimal probability of occurrence of factors in a weighted string, we present an algorithm requiring average-case search time for pattern matching for weight ratio . For a pattern string of length , a weighted text string of length , and a cumulative…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
