Beyond Relational: Semantic-Aware Multi-Modal Analytics with LLM-Native Query Optimization
Junhao Zhu, Lu Chen, Xiangyu Ke, Ziquan Fang, Tianyi Li, Yunjun Gao, Christian S. Jensen

TL;DR
Nirvana is a semantic-aware multi-modal data analytics framework leveraging LLMs, featuring advanced query optimization and cost-aware operator selection, significantly improving efficiency and scalability over existing systems.
Contribution
It introduces a novel semantic query optimizer and cost-aware physical plan selection tailored for LLM-driven multi-modal analytics, surpassing traditional relational approaches.
Findings
Reduces end-to-end runtime by up to 85%.
Decreases processing costs by 76% on average.
Outperforms state-of-the-art systems in efficiency and scalability.
Abstract
Multi-modal analytical processing has the potential to transform applications in e-commerce, healthcare, entertainment, and beyond. However, real-world adoption remains elusive due to the limited ability of traditional relational query operators to capture query semantics. The emergence of foundation models, particularly the large language models (LLMs), opens up new opportunities to develop flexible, semantic-aware data analytics systems that transcend the relational paradigm. We present Nirvana, a multi-modal data analytics framework that incorporates programmable semantic operators while leveraging both logical and physical query optimization strategies, tailored for LLM-driven semantic query processing. Nirvana addresses two key challenges. First, it features an agentic logical optimizer that uses natural language-specified transformation rules and random-walk-based search to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Quality and Management · Big Data and Digital Economy · Semantic Web and Ontologies
