RRR: Rank-Regret Representative
Abolfazl Asudeh, Azade Nazi, Nan Zhang, Gautam Das, H. V., Jagadish

TL;DR
This paper introduces the concept of rank-regret representatives, small data subsets ensuring top-ranked items across all ranking functions, addressing user understanding and computational challenges with geometric and approximation algorithms.
Contribution
It proposes a novel rank-based regret measure and develops efficient algorithms to find minimal subsets with guaranteed top-ranked items for all ranking functions.
Findings
Small subsets with low rank-regret can be efficiently identified.
Geometric bounds enable effective approximation algorithms.
Experimental results show practical applicability on real datasets.
Abstract
Selecting the best items in a dataset is a common task in data exploration. However, the concept of "best" lies in the eyes of the beholder: different users may consider different attributes more important, and hence arrive at different rankings. Nevertheless, one can remove "dominated" items and create a "representative" subset of the data set, comprising the "best items" in it. A Pareto-optimal representative is guaranteed to contain the best item of each possible ranking, but it can be almost as big as the full data. Representative can be found if we relax the requirement to include the best item for every possible user, and instead just limit the users' "regret". Existing work defines regret as the loss in score by limiting consideration to the representative instead of the full data set, for any chosen ranking function. However, the score is often not a meaningful number and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
