Assessing and Improving Punctuation Robustness in English-Marathi Machine Translation

Kaustubh Shivshankar Shejole; Sourabh Deoghare; Pushpak Bhattacharyya

arXiv:2601.09725·cs.CL·February 16, 2026

Assessing and Improving Punctuation Robustness in English-Marathi Machine Translation

Kaustubh Shivshankar Shejole, Sourabh Deoghare, Pushpak Bhattacharyya

PDF

Open Access 5 Models 1 Datasets 1 Video

TL;DR

This paper introduces Viram, a benchmark for testing punctuation robustness in English-Marathi NMT, and evaluates strategies to improve translation quality when punctuation is missing or incorrect.

Contribution

The work presents a new benchmark dataset and compares remediation strategies, showing their effectiveness over existing models in handling punctuation errors.

Findings

01

Both remediation strategies improve NMT performance significantly.

02

Current LLMs are less robust than task-specific strategies for punctuation errors.

03

Viram benchmark exposes weaknesses in existing NMT systems regarding punctuation robustness.

Abstract

Neural Machine Translation (NMT) systems rely heavily on explicit punctuation cues to resolve semantic ambiguities in a source sentence. Inputting user-generated sentences, which are likely to contain missing or incorrect punctuation, results in fluent but semantically disastrous translations. This work attempts to highlight and address the problem of punctuation robustness of NMT systems through an English-to-Marathi translation. First, we introduce \textbf{\textit{Viram}}, a human-curated diagnostic benchmark of 54 punctuation-ambiguous English-Marathi sentence pairs to stress-test existing NMT systems. Second, we evaluate two simple remediation strategies: cascade-based \textit{restore-then-translate} and \textit{direct fine-tuning}. Our experimental results and analysis demonstrate that both strategies yield substantial NMT performance improvements. Furthermore, we find that current…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

Datasets

thenlpresearcher/test_data_human_validated_eng_mar
dataset· 24 dl
24 dl

Videos

Assessing and Improving Punctuation Robustness in English-Marathi Machine Translation· underline

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Text Readability and Simplification