To GPU or Not to GPU: Vector Search in Relational Engines

Vasilis Mageirakos; Joel Andr\'e; Marko Kabi\'c; Bowen Wu; Yannis Chronis; Gustavo Alonso

arXiv:2605.15957·cs.DB·May 18, 2026

To GPU or Not to GPU: Vector Search in Relational Engines

Vasilis Mageirakos, Joel Andr\'e, Marko Kabi\'c, Bowen Wu, Yannis Chronis, Gustavo Alonso

PDF

TL;DR

This paper investigates whether integrating GPU-based vector search in relational database engines improves performance, through extending benchmarks, developing a modular engine, and extensive experiments comparing CPU and GPU deployments.

Contribution

It introduces a modular execution engine for SQL+VS queries, extends benchmarks with vector data, and provides insights on optimizing GPU integration for vector search in databases.

Findings

01

Relational components benefit significantly from GPU execution.

02

Data and index movement can negate GPU advantages in vector search.

03

Optimized index organization enhances GPU-based vector search performance.

Abstract

Vector search (VS) is now available in most database engines. However, while vector search is a common feature in AI/ML/LLMs where the dominant computing platforms are GPUs, existing database engines operate on CPUs even when implementing vector search. This raises the question of whether integrating vector processing on GPUs as part of the engine would be a better design. In this paper, we explore this question in detail. First, we extend the TPC-H benchmark with vector data (from text and images) and propose a number of representative SQL+VS queries. Second, we develop a modular execution engine that can run SQL+VS queries across CPU and GPU. Third, we perform extensive experiments on a number of deployments: running the SQL+VS queries across CPU and/or GPU, with data residing in CPU or GPU memory, with existing indices and novel, optimized versions, as well as across different GPUs…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.