How to select the largest k elements from evolving data?
Qin Huang, Xingwu Liu, Xiaoming Sun, and Jialin Zhang

TL;DR
This paper studies the problem of selecting the top-k elements from dynamically changing data, identifying conditions for error-free solutions and extending results to broader models.
Contribution
It systematically addresses top-k selection in dynamic data, identifying a critical threshold and providing tight bounds, extending previous special-case studies.
Findings
Error-free top-k selection is possible when k=o(k*)
A tight lower bound on error for k=Ω(k*)
Most results extend to broader dynamic models
Abstract
In this paper we investigate the top--selection problem, i.e. determine the largest, second largest, ..., and the -th largest elements, in the dynamic data model. In this model the order of elements evolves dynamically over time. In each time step the algorithm can only probe the changes of data by comparing a pair of elements. Previously only two special cases were studied[2]: finding the largest element and the median; and sorting all elements. This paper systematically deals with and solves the problem almost completely. Specifically, we identify a critical point such that the top--selection problem can be solved error-free with probability if and only if . A lower bound of the error when is also determined, which actually is tight under some condition. On the other hand, it is shown that the top--set problem, which…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGraph Theory and Algorithms · Digital Image Processing Techniques · Algorithms and Data Compression
