Pivot Through English: Reliably Answering Multilingual Questions without Document Retrieval
Ivan Montero, Shayne Longpre, Ni Lao, Andrew J. Frank, Christopher, DuBois

TL;DR
This paper introduces a method to answer questions in low-resource languages by leveraging strong English question-answering models and a pivoting approach, avoiding the need for document retrieval or language-specific supervision.
Contribution
The paper proposes RM-MIPS, a novel pivot-based retrieval method that outperforms existing baselines in multilingual question answering without target language training data.
Findings
RM-MIPS outperforms baselines by 2.7% on XQuAD and 6.2% on MKQA
Effective in low-resource languages with distractors and distribution shifts
Enables rapid, off-the-shelf multilingual question answering without additional training
Abstract
Existing methods for open-retrieval question answering in lower resource languages (LRLs) lag significantly behind English. They not only suffer from the shortcomings of non-English document retrieval, but are reliant on language-specific supervision for either the task or translation. We formulate a task setup more realistic to available resources, that circumvents document retrieval to reliably transfer knowledge from English to lower resource languages. Assuming a strong English question answering model or database, we compare and analyze methods that pivot through English: to map foreign queries to English and then English answers back to target language answers. Within this task setup we propose Reranked Multilingual Maximal Inner Product Search (RM-MIPS), akin to semantic similarity retrieval over the English training set with reranking, which outperforms the strongest baselines…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Multimodal Machine Learning Applications · Natural Language Processing Techniques
