A Lightweight Learned Cardinality Estimation Model
Yaoyu Zhu, Jintao Zhang, Guoliang Li, and Jianhua Feng

TL;DR
This paper introduces CoDe, a lightweight, data-driven cardinality estimation model that combines covering design and tensor decomposition to achieve high accuracy and efficiency in query result prediction.
Contribution
The paper presents a novel approach using covering design and tensor decomposition for fast, accurate cardinality estimation in databases, outperforming existing methods.
Findings
Achieves state-of-the-art accuracy and efficiency in cardinality estimation.
Estimates over half of queries with absolute accuracy across datasets.
Introduces innovative algorithms for distribution selection and modeling.
Abstract
Cardinality estimation is a fundamental task in database management systems, aiming to predict query results accurately without executing the queries. However, existing techniques either achieve low estimation accuracy or incur high inference latency. Simultaneously achieving high speed and accuracy becomes critical for the cardinality estimation problem. In this paper, we propose a novel data-driven approach called CoDe (Covering with Decompositions) to address this problem. CoDe employs the concept of covering design, which divides the table into multiple smaller, overlapping segments. For each segment, CoDe utilizes tensor decomposition to accurately model its data distribution. Moreover, CoDe introduces innovative algorithms to select the best-fitting distributions for each query, combining them to estimate the final result. By employing multiple models to approximate distributions,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
