AI-Driven Research for Databases

Audrey Cheng; Harald Ng; Aaron Kabcenell; Peter Bailis; Matei Zaharia; Lin Ma; Xiao Shi; Ion Stoica

arXiv:2604.06566·cs.DB·April 9, 2026

AI-Driven Research for Databases

Audrey Cheng, Harald Ng, Aaron Kabcenell, Peter Bailis, Matei Zaharia, Lin Ma, Xiao Shi, Ion Stoica

PDF

TL;DR

This paper introduces an automated approach using AI to design evaluators that assess database system optimizations, enabling the discovery of novel, high-performance algorithms through co-evolution.

Contribution

It proposes a co-evolution method for automating evaluator design, overcoming the evaluation bottleneck in AI-driven research for databases.

Findings

01

Automated evaluators enabled discovery of algorithms with up to 6.8x lower latency.

02

Co-evolving evaluators with solutions improves optimization effectiveness.

03

Demonstrated success in buffer management, query rewriting, and index selection.

Abstract

As the complexity of modern workloads and hardware increasingly outpaces human research and engineering capacity, existing methods for database performance optimization struggle to keep pace. To address this gap, a new class of techniques, termed AI-Driven Research for Systems (ADRS), uses large language models to automate solution discovery. This approach shifts optimization from manual system design to automated code generation. The key obstacle, however, in applying ADRS is the evaluation pipeline. Since these frameworks rapidly generate hundreds of candidates without human supervision, they depend on fast and accurate feedback from evaluators to converge on effective solutions. Building such evaluators is especially difficult for complex database systems. To enable the practical application of ADRS in this domain, we propose automating the design of evaluators by co-evolving them…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.