MoMQ: Mixture-of-Experts Enhances Multi-Dialect Query Generation across Relational and Non-Relational Databases
Zhisheng Lin, Yifu Liu, Zhiling Luo, Jinyang Gao, Yu Li

TL;DR
MoMQ introduces a Mixture-of-Experts framework that significantly improves multi-dialect query generation across various database types by addressing dialect differences and data imbalance.
Contribution
The paper presents MoMQ, a novel multi-dialect query generation framework using Mixture-of-Experts to handle dialect-specific knowledge and data imbalance effectively.
Findings
MoMQ outperforms baseline models in multi-dialect query generation tasks.
MoMQ demonstrates robustness in resource-imbalanced scenarios.
A new benchmark covers relational and non-relational databases for multi-dialect query generation.
Abstract
The improvement in translating natural language to structured query language (SQL) can be attributed to the advancements in large language models (LLMs). Open-source LLMs, tailored for specific database dialects such as MySQL, have shown great performance. However, cloud service providers are looking for a unified database manager service (e.g., Cosmos DB from Azure, Amazon Aurora from AWS, Lindorm from AlibabaCloud) that can support multiple dialects. This requirement has led to the concept of multi-dialect query generation, which presents challenges to LLMs. These challenges include syntactic differences among dialects and imbalanced data distribution across multiple dialects. To tackle these challenges, we propose MoMQ, a novel Mixture-of-Experts-based multi-dialect query generation framework across both relational and non-relational databases. MoMQ employs a dialect expert group for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Speech and dialogue systems · Data Mining Algorithms and Applications
Methodstravel james
