Efficient Learned Query Execution over Text and Tables [Technical Report]
Matthias Urban, Carsten Binnig

TL;DR
ELEET introduces an efficient engine that integrates text and tables using learned multimodal operators, enabling fast, accurate query execution without relying on large language models like GPT-4.
Contribution
ELEET proposes a novel architecture and training method for a small language model that efficiently extracts structured data from text to enable multimodal query processing.
Findings
ELEET achieves up to 575x faster query execution compared to LLM-based baselines.
ELEET maintains high accuracy in extracting structured data from text.
The approach significantly reduces runtime overhead for multimodal data queries.
Abstract
In this paper, we present ELEET, a novel execution engine that allows one to seamlessly query and process text as a first-class citizen along with tables. To enable such a seamless integration of text and tables, ELEET leverages learned multi-modal operators (MMOps) such as joins and unions that seamlessly combine structured with unstructured textual data. While large language models (LLM) such as GPT-4 are interesting candidates to enable such learned multimodal operations, we deliberately do not follow this trend to enable MMOps, since it would result in high overhead at query runtime. Instead, to enable MMOps, ELEET comes with a more efficient small language model (SLM) that is targeted to extract structured data from text. Thanks to our novel architecture and pre-training procedure, the ELEET-model enables high-accuracy extraction with low overheads. In our evaluation, we compare…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Quality and Management · Advanced Database Systems and Queries
