Faster Relational Algorithms Using Geometric Data Structures
Aryan Esmailpour, Stavros Sintos

TL;DR
This paper introduces a novel geometric data structure, RBBD tree, that accelerates relational data optimization tasks like clustering by avoiding full join materialization, achieving significant speedups over existing methods.
Contribution
The paper presents the RBBD tree, a randomized geometric data structure for relational data, enabling faster algorithms for optimization tasks without full join computation.
Findings
Improved clustering algorithms with k-factor speedup.
Maintains approximation guarantees while reducing runtime.
Applicable to various relational optimization problems.
Abstract
Optimization tasks over relational data, such as clustering, often suffer from the prohibitive cost of join operations, which are necessary to access the full dataset. While geometric data structures like BBD trees yield fast approximation algorithms in the standard computational setting, their application to relational data remains unclear due to the size of the join output. In this paper, we introduce a framework that leverages geometric insights to design faster algorithms when the data is stored as the results of a join query in a relational database. Our core contribution is the development of the RBBD tree, a randomized variant of the BBD tree tailored for relational settings. Instead of completely constructing the RBBD tree, by leveraging efficient sampling and counting techniques over relational joins, we enable on-the-fly efficient expansion of the RBBD tree, maintaining only…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Management and Algorithms · Advanced Clustering Algorithms Research · Topological and Geometric Data Analysis
