TL;DR
This paper introduces a hybrid Hindi-English machine translation system combining phrase-based SMT, EBMT, and RBMT to improve translation quality, especially for ambiguous and idiomatic sentences.
Contribution
It presents a novel hybrid data-driven MT approach that outperforms individual SMT, EBMT, and RBMT systems by integrating their strengths.
Findings
Hybrid MT outperforms baseline systems in fluency and accuracy.
The system handles ambiguous sentences and idioms better than existing models.
Comparison shows superior performance over Google, BING, and Babylonian translators.
Abstract
In this paper, an extended combined approach of phrase based statistical machine translation (SMT), example based MT (EBMT) and rule based MT (RBMT) is proposed to develop a novel hybrid data driven MT system capable of outperforming the baseline SMT, EBMT and RBMT systems from which it is derived. In short, the proposed hybrid MT process is guided by the rule based MT after getting a set of partial candidate translations provided by EBMT and SMT subsystems. Previous works have shown that EBMT systems are capable of outperforming the phrase-based SMT systems and RBMT approach has the strength of generating structurally and morphologically more accurate results. This hybrid approach increases the fluency, accuracy and grammatical precision which improve the quality of a machine translation system. A comparison of the proposed hybrid machine translation (HTM) model with renowned…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
