Top-K Color Queries for Document Retrieval
Marek Karpinski, Yakov Nekrich

TL;DR
The paper introduces an optimal data structure for top-K color queries in arrays, enabling efficient retrieval of the highest-priority colors within a range, with applications to document retrieval.
Contribution
It presents a novel, optimal data structure for top-K color queries that is both space-efficient and provides worst-case optimal query time.
Findings
Queries answered in O(K) time
Data structure uses O(N log σ) bits
Applications to document retrieval problems
Abstract
In this paper we describe a new efficient (in fact optimal) data structure for the {\em top- color problem}. Each element of an array is assigned a color with priority . For a query range and a value , we have to report colors with the highest priorities among all colors that occur in , sorted in reverse order by their priorities. We show that such queries can be answered in time using an bits data structure, where is the number of elements in the array and is the number of colors. Thus our data structure is asymptotically optimal with respect to the worst-case query time and space. As an immediate application of our results, we obtain optimal time solutions for several document retrieval problems. The method of the paper could be also of independent interest.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsOptimization and Search Problems · Advanced Image and Video Retrieval Techniques · Graph Labeling and Dimension Problems
