Learning to Translate for Multilingual Question Answering

Ferhan Ture; Elizabeth Boschee

arXiv:1609.08210·cs.CL·September 28, 2016

Learning to Translate for Multilingual Question Answering

Ferhan Ture, Elizabeth Boschee

PDF

TL;DR

This paper introduces a learn-to-translate approach for multilingual question answering that optimally combines different translation methods and directions, outperforming traditional translation-based methods on a multilingual forum dataset.

Contribution

It proposes a novel model that learns optimal feature weights for multiple translation methods and directions in multilingual QA, improving over standard translation strategies.

Findings

01

Learn-to-translate approach outperforms baseline (p<0.05)

02

Effective in multilingual forum dataset with English, Arabic, Chinese

03

Combines multiple translation methods and directions for better QA performance

Abstract

In multilingual question answering, either the question needs to be translated into the document language, or vice versa. In addition to direction, there are multiple methods to perform the translation, four of which we explore in this paper: word-based, 10-best, context-based, and grammar-based. We build a feature for each combination of translation direction and method, and train a model that learns optimal feature weights. On a large forum dataset consisting of posts in English, Arabic, and Chinese, our novel learn-to-translate approach was more effective than a strong baseline (p<0.05): translating all text into English, then training a classifier based only on English (original or translated) text.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.