Gen-DBA: Generative Database Agents
Yeasir Rayhan, Walid G. Aref

TL;DR
Gen-DBA introduces a versatile foundation model designed to optimize databases across diverse environments, overcoming the limitations of specialized ML models by offering broader generalization and adaptability.
Contribution
This paper proposes the concept and initial design of Gen-DBA, a unified foundation model for database optimization with agentic capabilities, addressing current ML4DB limitations.
Findings
Identifies limitations of narrow ML models in database optimization.
Proposes a vision and sketch design for a general-purpose foundation model.
Highlights research challenges for developing Gen-DBA.
Abstract
Leveraging Machine Learning to optimize database systems, referred to as Machine Learning for Databases (ML4DB, for short), dates back to the early 1990s, spanning indexing techniques, selectivity estimation, and query optimization. However, the idea has gained mainstream traction following the introduction of learned indexes in 2018, triggering a surge of research spanning learned indexes and cardinality estimators to learned query optimizers, storage layout design, resource management, and database tuning. The current ML4DB optimization landscape is dominated by narrow specialist ML models that are small and are trained on limited training data. Each specialist ML model targets a single database learning task on a fixed database engine, hardware platform, query workload, and optimization objective. As a result, they fall short in real-world settings, where these factors can vary…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Big Data and Digital Economy · Multimodal Machine Learning Applications
