Evaluating Machine Translation Models for English-Hindi Language Pairs: A Comparative Analysis

Ahan Prasannakumar Shetty

arXiv:2505.19604·cs.CL·May 27, 2025

Evaluating Machine Translation Models for English-Hindi Language Pairs: A Comparative Analysis

Ahan Prasannakumar Shetty

PDF

Open Access 1 Repo

TL;DR

This paper provides a comprehensive evaluation of various machine translation models for English-Hindi, using diverse datasets and metrics to analyze their effectiveness across general and specialized language domains.

Contribution

It offers a comparative analysis of multiple translation models with insights into their strengths and weaknesses for English-Hindi translation tasks.

Findings

01

Performance varies across models and metrics

02

Certain models excel in general translation

03

Specialized datasets reveal specific strengths and weaknesses

Abstract

Machine translation has become a critical tool in bridging linguistic gaps, especially between languages as diverse as English and Hindi. This paper comprehensively evaluates various machine translation models for translating between English and Hindi. We assess the performance of these models using a diverse set of automatic evaluation metrics, both lexical and machine learning-based metrics. Our evaluation leverages an 18000+ corpus of English Hindi parallel dataset and a custom FAQ dataset comprising questions from government websites. The study aims to provide insights into the effectiveness of different machine translation approaches in handling both general and specialized language domains. Results indicate varying performance levels across different metrics, highlighting strengths and areas for improvement in current translation systems.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ahanps/english-hindi-parallel-corpus
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques

MethodsSparse Evolutionary Training