Probabilistic Top-k Dominating Queries in Distributed Uncertain Databases (Technical Report)
Niranjan Rai, Xiang Lian

TL;DR
This paper addresses the challenge of efficiently processing probabilistic top-k dominating queries on large-scale uncertain data in distributed environments, proposing a MapReduce framework with novel pruning and indexing strategies.
Contribution
It introduces a new distributed MapReduce framework for probabilistic top-k dominating queries on uncertain data, including pruning strategies and cost-model-based index distribution.
Findings
The proposed approach is efficient on real and synthetic datasets.
It effectively filters false alarms in distributed uncertain databases.
Experimental results demonstrate high effectiveness and scalability.
Abstract
In many real-world applications such as business planning and sensor data monitoring, one important, yet challenging, the task is to rank objects(e.g., products, documents, or spatial objects) based on their ranking scores and efficiently return those objects with the highest scores. In practice, due to the unreliability of data sources, many real-world objects often contain noises and are thus imprecise and uncertain. In this paper, we study the problem of probabilistic top-k dominating(PTD) query on such large-scale uncertain data in a distributed environment, which retrieves k uncertain objects from distributed uncertain databases(on multiple distributed servers), having the largest ranking scores with high confidences. In order to efficiently tackle the distributed PTD problem, we propose a MapReduce framework for processing distributed PTD queries over distributed uncertain…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Management and Algorithms · Advanced Database Systems and Queries · Automated Road and Building Extraction
