A Lightweight Learned Cardinality Estimation Model

Yaoyu Zhu; Jintao Zhang; Guoliang Li; and Jianhua Feng

arXiv:2508.09602·cs.DB·August 14, 2025

A Lightweight Learned Cardinality Estimation Model

Yaoyu Zhu, Jintao Zhang, Guoliang Li, and Jianhua Feng

PDF

TL;DR

This paper introduces CoDe, a lightweight, data-driven cardinality estimation model that combines covering design and tensor decomposition to achieve high accuracy and efficiency in query result prediction.

Contribution

The paper presents a novel approach using covering design and tensor decomposition for fast, accurate cardinality estimation in databases, outperforming existing methods.

Findings

01

Achieves state-of-the-art accuracy and efficiency in cardinality estimation.

02

Estimates over half of queries with absolute accuracy across datasets.

03

Introduces innovative algorithms for distribution selection and modeling.

Abstract

Cardinality estimation is a fundamental task in database management systems, aiming to predict query results accurately without executing the queries. However, existing techniques either achieve low estimation accuracy or incur high inference latency. Simultaneously achieving high speed and accuracy becomes critical for the cardinality estimation problem. In this paper, we propose a novel data-driven approach called CoDe (Covering with Decompositions) to address this problem. CoDe employs the concept of covering design, which divides the table into multiple smaller, overlapping segments. For each segment, CoDe utilizes tensor decomposition to accurately model its data distribution. Moreover, CoDe introduces innovative algorithms to select the best-fitting distributions for each query, combining them to estimate the final result. By employing multiple models to approximate distributions,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.