Filling Knowledge Gaps in a Broad-Coverage Machine Translation System
Kevin Knight, Ishwar Chander, Matthew Haines, Vasileios, Hatzivassiloglou, Eduard Hovy, Masayo Iida, Steve K. Luk, Richard Whitney,, Kenji Yamada (USC/Information Sciences Institute)

TL;DR
This paper discusses methods to fill knowledge gaps in a broad-coverage Japanese-English machine translation system, enhancing its robustness and coverage by leveraging statistical techniques to improve translation quality across diverse domains.
Contribution
It introduces techniques for filling knowledge gaps in KBMT systems, demonstrating their effectiveness on a broad-coverage Japanese-English translation system.
Findings
Improved translation quality across multiple domains
Effective use of statistical techniques for knowledge gap filling
Enhanced system robustness in the absence of definitive knowledge
Abstract
Knowledge-based machine translation (KBMT) techniques yield high quality in domains with detailed semantic models, limited vocabulary, and controlled input grammar. Scaling up along these dimensions means acquiring large knowledge resources. It also means behaving reasonably when definitive knowledge is not yet available. This paper describes how we can fill various KBMT knowledge gaps, often using robust statistical techniques. We describe quantitative and qualitative results from JAPANGLOSS, a broad-coverage Japanese-English MT system.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Text Readability and Simplification
