A Representation Level Analysis of NMT Model Robustness to Grammatical Errors

Abderrahmane Issam; Yusuf Can Semerci; Jan Scholtes; Gerasimos Spanakis

arXiv:2505.21224·cs.CL·May 28, 2025

A Representation Level Analysis of NMT Model Robustness to Grammatical Errors

Abderrahmane Issam, Yusuf Can Semerci, Jan Scholtes, Gerasimos Spanakis

PDF

Open Access 1 Video

TL;DR

This paper investigates how neural machine translation models internally handle grammatical errors by analyzing their representations and attention mechanisms, revealing a process of error detection and correction within model layers.

Contribution

It introduces a representation-level analysis of NMT robustness, identifying the role of Robustness Heads and their impact on handling grammatical errors.

Findings

01

Encoder detects and corrects grammatical errors through internal representations.

02

Robustness Heads attend to linguistic units relevant to errors.

03

Fine-tuning enhances reliance on Robustness Heads for correction.

Abstract

Understanding robustness is essential for building reliable NLP systems. Unfortunately, in the context of machine translation, previous work mainly focused on documenting robustness failures or improving robustness. In contrast, we study robustness from a model representation perspective by looking at internal model representations of ungrammatical inputs and how they evolve through model layers. For this purpose, we perform Grammatical Error Detection (GED) probing and representational similarity analysis. Our findings indicate that the encoder first detects the grammatical error, then corrects it by moving its representation toward the correct form. To understand what contributes to this process, we turn to the attention mechanism where we identify what we term Robustness Heads. We find that Robustness Heads attend to interpretable linguistic units when responding to grammatical…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

A Representation Level Analysis of NMT Model Robustness to Grammatical Errors· underline

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Speech Recognition and Synthesis