Phrase database Approach to structural and semantic disambiguation in English-Korean Machine Translation
Myong-Chol Pak

TL;DR
This paper proposes a bilingual phrase database to improve structural and semantic disambiguation in English-Korean machine translation, addressing issues caused by idiomatic and non-grammatical phrases.
Contribution
It introduces a new approach using a phrase database to resolve ambiguities in EKMT, including methods for database construction and application.
Findings
Enhanced translation accuracy for idiomatic phrases
Reduction in structural ambiguities during parsing
Improved semantic clarity in translations
Abstract
In machine translation it is common phenomenon that machine-readable dictionaries and standard parsing rules are not enough to ensure accuracy in parsing and translating English phrases into Korean language, which is revealed in misleading translation results due to consequent structural and semantic ambiguities. This paper aims to suggest a solution to structural and semantic ambiguities due to the idiomaticity and non-grammaticalness of phrases commonly used in English language by applying bilingual phrase database in English-Korean Machine Translation (EKMT). This paper firstly clarifies what the phrase unit in EKMT is based on the definition of the English phrase, secondly clarifies what kind of language unit can be the target of the phrase database for EKMT, thirdly suggests a way to build the phrase database by presenting the format of the phrase database with examples, and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Translation Studies and Practices
