Multi-Attribute Selectivity Estimation Using Deep Learning
Shohedul Hasan, Saravanan Thirumuruganathan, Jees Augustine, Nick, Koudas, Gautam Das

TL;DR
This paper explores deep learning methods for multi-attribute selectivity estimation in databases, introducing unsupervised density estimation and supervised prediction approaches to improve accuracy and efficiency.
Contribution
It presents novel deep learning techniques for selectivity estimation, addressing practical challenges and demonstrating superior performance over traditional methods.
Findings
Deep density estimation effectively models joint attribute distributions.
Supervised deep learning accurately predicts query selectivity.
Proposed methods outperform existing techniques in speed and accuracy.
Abstract
Selectivity estimation - the problem of estimating the result size of queries - is a fundamental problem in databases. Accurate estimation of query selectivity involving multiple correlated attributes is especially challenging. Poor cardinality estimates could result in the selection of bad plans by the query optimizer. We investigate the feasibility of using deep learning based approaches for both point and range queries and propose two complementary approaches. Our first approach considers selectivity as an unsupervised deep density estimation problem. We successfully introduce techniques from neural density estimation for this purpose. The key idea is to decompose the joint distribution into a set of tractable conditional probability distributions such that they satisfy the autoregressive property. Our second approach formulates selectivity estimation as a supervised deep learning…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Quality and Management · Data Management and Algorithms · Advanced Database Systems and Queries
