# Research Challenges in Relational Database Management Systems for LLM Queries

**Authors:** Kerem Akillioglu, Anurag Chakraborty, Sairaj Voruganti, M. Tamer \"Ozsu

arXiv: 2508.20912 · 2025-08-29

## TL;DR

This paper explores the challenges and limitations of integrating large language models with relational database systems, highlighting issues in output structuring, resource use, and query planning, with initial solutions showing promising improvements.

## Contribution

It provides an early analysis of open-source and enterprise LLM-DBMS integrations, identifying key challenges and proposing initial solutions for better scalability and efficiency.

## Key findings

- Identified core issues in LLM-DBMS integration: output structuring, resource optimization, query planning.
- Implemented initial solutions that improved handling of LLM-powered SQL queries.
- Early results suggest tighter LLM-DBMS integration enhances scalability and performance.

## Abstract

Large language models (LLMs) have become essential for applications such as text summarization, sentiment analysis, and automated question-answering. Recently, LLMs have also been integrated into relational database management systems to enhance querying and support advanced data processing. Companies such as Amazon, Databricks, Google, and Snowflake offer LLM invocation directly within SQL, denoted as LLM queries, to boost data insights. However, open-source solutions currently have limited functionality and poor performance. In this work, we present an early exploration of two open-source systems and one enterprise platform, using five representative queries to expose functional, performance, and scalability limits in today's SQL-invoked LLM integrations. We identify three main issues: enforcing structured outputs, optimizing resource utilization, and improving query planning. We implemented initial solutions and observed improvements in accommodating LLM powered SQL queries. These early gains demonstrate that tighter integration of LLM+DBMS is the key to scalable and efficient processing of LLM queries.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/2508.20912/full.md

## Figures

15 figures with captions in the complete paper: https://tomesphere.com/paper/2508.20912/full.md

## References

52 references — full list in the complete paper: https://tomesphere.com/paper/2508.20912/full.md

---
Source: https://tomesphere.com/paper/2508.20912