To GPU or Not to GPU: Vector Search in Relational Engines
Vasilis Mageirakos, Joel Andr\'e, Marko Kabi\'c, Bowen Wu, Yannis Chronis, Gustavo Alonso

TL;DR
This paper investigates whether integrating GPU-based vector search in relational database engines improves performance, through extending benchmarks, developing a modular engine, and extensive experiments comparing CPU and GPU deployments.
Contribution
It introduces a modular execution engine for SQL+VS queries, extends benchmarks with vector data, and provides insights on optimizing GPU integration for vector search in databases.
Findings
Relational components benefit significantly from GPU execution.
Data and index movement can negate GPU advantages in vector search.
Optimized index organization enhances GPU-based vector search performance.
Abstract
Vector search (VS) is now available in most database engines. However, while vector search is a common feature in AI/ML/LLMs where the dominant computing platforms are GPUs, existing database engines operate on CPUs even when implementing vector search. This raises the question of whether integrating vector processing on GPUs as part of the engine would be a better design. In this paper, we explore this question in detail. First, we extend the TPC-H benchmark with vector data (from text and images) and propose a number of representative SQL+VS queries. Second, we develop a modular execution engine that can run SQL+VS queries across CPU and GPU. Third, we perform extensive experiments on a number of deployments: running the SQL+VS queries across CPU and/or GPU, with data residing in CPU or GPU memory, with existing indices and novel, optimized versions, as well as across different GPUs…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
