Evaluating Machine Translation Models for English-Hindi Language Pairs: A Comparative Analysis
Ahan Prasannakumar Shetty

TL;DR
This paper provides a comprehensive evaluation of various machine translation models for English-Hindi, using diverse datasets and metrics to analyze their effectiveness across general and specialized language domains.
Contribution
It offers a comparative analysis of multiple translation models with insights into their strengths and weaknesses for English-Hindi translation tasks.
Findings
Performance varies across models and metrics
Certain models excel in general translation
Specialized datasets reveal specific strengths and weaknesses
Abstract
Machine translation has become a critical tool in bridging linguistic gaps, especially between languages as diverse as English and Hindi. This paper comprehensively evaluates various machine translation models for translating between English and Hindi. We assess the performance of these models using a diverse set of automatic evaluation metrics, both lexical and machine learning-based metrics. Our evaluation leverages an 18000+ corpus of English Hindi parallel dataset and a custom FAQ dataset comprising questions from government websites. The study aims to provide insights into the effectiveness of different machine translation approaches in handling both general and specialized language domains. Results indicate varying performance levels across different metrics, highlighting strengths and areas for improvement in current translation systems.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques
MethodsSparse Evolutionary Training
