Beyond Single-Modal Analytics: A Framework for Integrating Heterogeneous LLM-Based Query Systems for Multi-Modal Data
Ruyu Li, Tinghui Zhang, Haodi Ma, Daisy Zhe Wang, Yifan Wang

TL;DR
This paper presents Meta Engine, a unified framework that integrates diverse LLM-based query systems to effectively handle multi-modal data, overcoming fragmentation and performance trade-offs in semantic querying.
Contribution
The paper introduces Meta Engine, a novel architecture that unifies heterogeneous LLM-based query systems for multi-modal data, improving integration and performance.
Findings
Meta Engine achieves 3-6x higher F1 scores than baselines.
It effectively integrates specialized systems for different modalities.
Performance gains are up to ~24x on certain datasets.
Abstract
With the increasing use of multi-modal data, semantic query has become more and more demanded in data management systems, which is an important way to access and analyze multi-modal data. As unstructured data, most information of multi-modal data (text, image, video, etc.) hides in the semantics, which cannot be accessed by traditional database queries like SQL. Given the power of Large Language Models (LLMs) in understanding semantics and processing natural language, in recent years several LLM-based semantic query systems have been proposed to support semantic querying over unstructured data. However, this rapid growth has produced a fragmented ecosystem. Applications face significant integration challenges due to (1) disparate APIs of different semantic query systems and (2) a fundamental trade-off between specialization and generality. Many semantic query systems are highly…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSemantic Web and Ontologies · Advanced Database Systems and Queries · Natural Language Processing Techniques
