Efficient Direct-Access Ranked Retrieval
Mohsen Dehghankar, Raghav Mittal, Suraj Shetiya, Abolfazl Asudeh, Gautam Das

TL;DR
This paper introduces efficient algorithms for direct-access ranked retrieval in high-dimensional datasets, balancing query speed and space complexity, and proposes a relaxed retrieval variant with practical scalability.
Contribution
It formalizes the DAR problem, proposes geometric and sampling-based algorithms, and introduces CSR and SRR for scalable, approximate ranked retrieval in large, high-dimensional datasets.
Findings
Logarithmic query time with geometric arrangements
Linear space algorithms based on ε-sampling
Scalable performance on datasets with millions of tuples and hundreds of dimensions
Abstract
We study the problem of Direct-Access Ranked Retrieval (DAR) for interactive data tooling, where evolving data exploration practices, combined with large-scale and high-dimensional datasets, create new challenges. DAR concerns the problem of enabling efficient access to arbitrary rank positions according to a ranking function, without enumerating all preceding tuples. To address this need, we formalize the DAR problem and propose a theoretically efficient algorithm based on geometric arrangements, achieving logarithmic query time. However, this method suffers from exponential space complexity in high dimensions. Therefore, we develop a second class of algorithms based on -sampling, which consume a linear space. Since exactly locating the tuple at a specific rank is challenging due to its connection to the range counting problem, we introduce a relaxed variant called…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Management and Algorithms · Advanced Image and Video Retrieval Techniques · Information Retrieval and Search Behavior
