Fabricator or dynamic translator?

Lisa Vasileva; Karin Sim

arXiv:2604.15165·cs.CL·April 17, 2026

Fabricator or dynamic translator?

Lisa Vasileva, Karin Sim

PDF

TL;DR

This paper explores the capabilities and challenges of large language models in machine translation, focusing on their overgeneration behaviors and strategies for detection in commercial applications.

Contribution

It introduces various strategies for identifying and understanding overgeneration in LLM-based translation, with practical insights for commercial deployment.

Findings

01

LLMs can produce diverse overgeneration types including explanations and confabulations.

02

Strategies for detecting overgeneration can improve translation reliability.

03

The work provides practical results from commercial setting experiments.

Abstract

LLMs are proving to be adept at machine translation although due to their generative nature they may at times overgenerate in various ways. These overgenerations are different from the neurobabble seen in NMT and range from LLM self-explanations, to risky confabulations, to appropriate explanations, where the LLM is able to act as a human translator would, enabling greater comprehension for the target audience. Detecting and determining the exact nature of the overgenerations is a challenging task. We detail different strategies we have explored for our work in a commercial setting, and present our results.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.