Filling Knowledge Gaps in a Broad-Coverage Machine Translation System

Kevin Knight; Ishwar Chander; Matthew Haines; Vasileios; Hatzivassiloglou; Eduard Hovy; Masayo Iida; Steve K. Luk; Richard Whitney,; Kenji Yamada (USC/Information Sciences Institute)

arXiv:cmp-lg/9506009·cmp-lg·February 3, 2008·44 cites

Filling Knowledge Gaps in a Broad-Coverage Machine Translation System

Kevin Knight, Ishwar Chander, Matthew Haines, Vasileios, Hatzivassiloglou, Eduard Hovy, Masayo Iida, Steve K. Luk, Richard Whitney,, Kenji Yamada (USC/Information Sciences Institute)

PDF

Open Access

TL;DR

This paper discusses methods to fill knowledge gaps in a broad-coverage Japanese-English machine translation system, enhancing its robustness and coverage by leveraging statistical techniques to improve translation quality across diverse domains.

Contribution

It introduces techniques for filling knowledge gaps in KBMT systems, demonstrating their effectiveness on a broad-coverage Japanese-English translation system.

Findings

01

Improved translation quality across multiple domains

02

Effective use of statistical techniques for knowledge gap filling

03

Enhanced system robustness in the absence of definitive knowledge

Abstract

Knowledge-based machine translation (KBMT) techniques yield high quality in domains with detailed semantic models, limited vocabulary, and controlled input grammar. Scaling up along these dimensions means acquiring large knowledge resources. It also means behaving reasonably when definitive knowledge is not yet available. This paper describes how we can fill various KBMT knowledge gaps, often using robust statistical techniques. We describe quantitative and qualitative results from JAPANGLOSS, a broad-coverage Japanese-English MT system.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Text Readability and Simplification