Statistical Vs Rule Based Machine Translation; A Case Study on Indian   Language Perspective

Sreelekha S

arXiv:1708.04559·cs.CL·August 16, 2017

Statistical Vs Rule Based Machine Translation; A Case Study on Indian Language Perspective

Sreelekha S

PDF

Open Access

TL;DR

This study compares statistical and rule-based machine translation systems for English-Indian and Indian-Indian language pairs, showing SMT generally outperforms RBMT, especially with limited training data, in various translation scenarios.

Contribution

It provides a comprehensive five-way performance comparison and detailed analysis of SMT and RBMT systems for Indian languages, highlighting their relative strengths and weaknesses.

Findings

01

SMT outperforms RBMT in most scenarios

02

English-Indian SMT performs better than Indian-English SMT

03

SMT is more effective for Indian-Indian language translation

Abstract

In this paper we present our work on a case study between Statistical Machien Transaltion (SMT) and Rule-Based Machine Translation (RBMT) systems on English-Indian langugae and Indian to Indian langugae perspective. Main objective of our study is to make a five way performance compariosn; such as, a) SMT and RBMT b) SMT on English-Indian langugae c) RBMT on English-Indian langugae d) SMT on Indian to Indian langugae perspective e) RBMT on Indian to Indian langugae perspective. Through a detailed analysis we describe the Rule Based and the Statistical Machine Translation system developments and its evaluations. Through a detailed error analysis, we point out the relative strengths and weaknesses of both systems. The observations based on our study are: a) SMT systems outperforms RBMT b) In the case of SMT, English to Indian language MT systmes performs better than Indian to English…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques