Integrating Multi-Head Convolutional Encoders with Cross-Attention for Improved SPARQL Query Translation
Yi-Hui Chen, Eric Jui-Lin Lu, Kwan-Ho Cheng

TL;DR
This paper introduces a Multi-Head Conv encoder with cross-attention for SPARQL query translation, achieving superior performance in KGQA systems with limited resources.
Contribution
It proposes a novel Multi-Head Conv encoder that combines convolutional layers with multi-head attention, improving translation accuracy in neural SPARQL generation.
Findings
Achieved 76.52% BLEU-1 on QALD-9 dataset.
Achieved 83.37% BLEU-1 on LC-QuAD-1.0 dataset.
Outperformed other KGQA systems in end-to-end experiments.
Abstract
The main task of the KGQA system (Knowledge Graph Question Answering) is to convert user input questions into query syntax (such as SPARQL). With the rise of modern popular encoders and decoders like Transformer and ConvS2S, many scholars have shifted the research direction of SPARQL generation to the Neural Machine Translation (NMT) architecture or the generative AI field of Text-to-SPARQL. In NMT-based QA systems, the system treats knowledge base query syntax as a language. It uses NMT-based translation models to translate natural language questions into query syntax. Scholars use popular architectures equipped with cross-attention, such as Transformer, ConvS2S, and BiLSTM, to train translation models for query syntax. To achieve better query results, this paper improved the ConvS2S encoder and added multi-head attention from the Transformer, proposing a Multi-Head Conv encoder (MHC…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Semantic Web and Ontologies
MethodsAttention Is All You Need · Adam · Layer Normalization · Position-Wise Feed-Forward Layer · Dense Connections · Tanh Activation · Residual Connection · Byte Pair Encoding · Absolute Position Encodings · Softmax
