Probabilistic Counting in Uncertain Spatial Databases using Generating Functions
Andreas Z\"ufle

TL;DR
This paper introduces a generating function-based method for probabilistic counting in uncertain spatial databases, enabling efficient query processing despite the exponential complexity of possible data worlds.
Contribution
It presents a novel application of generating functions to efficiently perform probabilistic counting in uncertain spatial data, addressing NP-hard query processing challenges.
Findings
Effective probabilistic counting method demonstrated
Applicable to range, kNN, and distance ranking queries
Improves efficiency in uncertain spatial data analysis
Abstract
Location data is inherently uncertain for many reasons including 1) imprecise location measurements, 2) obsolete observations that are often interpolated, and 3) deliberate obfuscation to preserve location privacy. What makes handling uncertainty data challenging is the exponentially large number of possible worlds, which lies in O(2^N), for a database having N uncertain objects as it has been shown that general query processing in uncertain spatial data is NP-hard. Many applications using spatial data require counting the number of spatial objects within a region. An example is the k-Nearest Neighbor (kNN) query: Asking if an object A is a kNN of another object Q is equivalent to asking whether no more than k-1 objects are located inside the circle centered at Q having a radius equal to the distance between Q and A. For this problem of counting uncertain objects within a region, an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Management and Algorithms · Geographic Information Systems Studies · Constraint Satisfaction and Optimization
