TL;DR
AMUSE introduces a multilingual semantic parsing pipeline that maps natural language questions to SPARQL queries using probabilistic models, overcoming lexical gaps with innovative translation and embedding techniques, and demonstrating effectiveness across multiple languages.
Contribution
It presents the first multilingual QALD pipeline that learns to convert questions into logical forms using probabilistic inference and language-independent representations.
Findings
Effective cross-lingual performance on QALD-6 dataset
Novel combination of machine translation and word embeddings
Overcomes lexical gaps in multilingual semantic parsing
Abstract
The task of answering natural language questions over RDF data has received wide interest in recent years, in particular in the context of the series of QALD benchmarks. The task consists of mapping a natural language question to an executable form, e.g. SPARQL, so that answers from a given KB can be extracted. So far, most systems proposed are i) monolingual and ii) rely on a set of hard-coded rules to interpret questions and map them into a SPARQL query. We present the first multilingual QALD pipeline that induces a model from training data for mapping a natural language question into logical form as probabilistic inference. In particular, our approach learns to map universal syntactic dependency representations to a language-independent logical form based on DUDES (Dependency-based Underspecified Discourse Representation Structures) that are then mapped to a SPARQL query as a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
