Memory-Efficient RkNN Retrieval by Nonlinear k-Distance Approximation

Sandra Obermeier; Max Berrendorf; Peer Kr\"oger

arXiv:2011.01773·cs.DB·November 4, 2020

Memory-Efficient RkNN Retrieval by Nonlinear k-Distance Approximation

Sandra Obermeier, Max Berrendorf, Peer Kr\"oger

PDF

1 Repo

TL;DR

This paper introduces a machine learning-based approach for more memory-efficient reverse k-nearest neighbor retrieval, addressing the limitations of linear approximation methods in real-world datasets with variable density.

Contribution

It proposes a nonlinear k-distance approximation framework that improves memory efficiency and performance in RkNN queries, especially under fixed memory constraints.

Findings

01

Significantly reduces index memory consumption.

02

Strongly decreases candidate set size.

03

Effectively handles datasets with changing density.

Abstract

The reverse k-nearest neighbor (RkNN) query is an established query type with various applications reaching from identifying highly influential objects over incrementally updating kNN graphs to optimizing sensor communication and outlier detection. State-of-the-art solutions exploit that the k-distances in real-world datasets often follow the power-law distribution, and bound them with linear lines in log-log space. In this work, we investigate this assumption and uncover that it is violated in regions of changing density, which we show are typical for real-life datasets. Towards a generic solution, we pose the estimation of k-distances as a regression problem. Thereby, we enable harnessing the power of the abundance of available Machine Learning models and profiting from their advancement. We propose a flexible approach which allows steering the performance-memory consumption…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

sobermeier/nonlinear-kdist
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.