TL;DR
QSPN is a unified model for cardinality estimation that combines data distribution and query workload using Sum-Product Networks, achieving high accuracy, low inference time, and reduced storage overhead in database systems.
Contribution
The paper introduces QSPN, a novel model extending Sum-Product Networks with new node types to jointly optimize accuracy, inference speed, and storage for cardinality estimation.
Findings
QSPN outperforms state-of-the-art methods in accuracy.
QSPN achieves faster inference times.
QSPN reduces storage overhead significantly.
Abstract
Cardinality estimation is a fundamental component in database systems, crucial for generating efficient execution plans. Despite advancements in learning-based cardinality estimation, existing methods may struggle to simultaneously optimize the key criteria: estimation accuracy, inference time, and storage overhead, limiting their practical applicability in real-world database environments. This paper introduces QSPN, a unified model that integrates both data distribution and query workload. QSPN achieves high estimation accuracy by modeling data distribution using the simple yet effective Sum-Product Network (SPN) structure. To ensure low inference time and reduce storage overhead, QSPN further partitions columns based on query access patterns. We formalize QSPN as a tree-based structure that extends SPNs by introducing two new node types: QProduct and QSplit. This paper studies the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
