New Algorithms for Position Heaps
Travis Gagie, Wing-Kai Hon, Tsung-Han Ku

TL;DR
This paper introduces new algorithms and techniques for position heaps, improving their construction, size limitations, and integration with suffix arrays, thus enhancing their efficiency and applicability in string processing tasks.
Contribution
It presents novel algorithms for limiting heap height, linear-time construction independent of alphabet size, and methods to connect position heaps with suffix arrays.
Findings
Linear-time construction of position heaps
Heap height can be limited by pattern length
Efficient simulation of position heaps with compressed suffix arrays
Abstract
We present several results about position heaps, a relatively new alternative to suffix trees and suffix arrays. First, we show that, if we limit the maximum length of patterns to be sought, then we can also limit the height of the heap and reduce the worst-case cost of insertions and deletions. Second, we show how to build a position heap in linear time independent of the size of the alphabet. Third, we show how to augment a position heap such that it supports access to the corresponding suffix array, and vice versa. Fourth, we introduce a variant of a position heap that can be simulated efficiently by a compressed suffix array with a linear number of extra bits.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAlgorithms and Data Compression · Genomics and Phylogenetic Studies · DNA and Biological Computing
