Evaluating In-Context Translation with Synchronous Context-Free Grammar Transduction
Jackson Petty, Jaulie Goe, Tal Linzen

TL;DR
This paper investigates how well large language models can perform in-context translation of formal languages using synchronous context-free grammars, revealing limitations related to grammar complexity, sentence length, and linguistic differences.
Contribution
It introduces a formal framework for evaluating LLMs' in-context translation abilities using constructed grammars that mimic natural language features.
Findings
LLMs' translation accuracy drops with larger grammars and longer sentences.
Morphological and script differences significantly reduce translation performance.
Models often hallucinate, mistranslate, or omit words in translation errors.
Abstract
Low-resource languages pose a challenge for machine translation with large language models (LLMs), which require large amounts of training data. One potential way to circumvent this data dependence is to rely on LLMs' ability to use in-context descriptions of languages, like textbooks and dictionaries. To do so, LLMs must be able to infer the link between the languages' grammatical descriptions and the sentences in question. Here we isolate this skill using a formal analogue of the task: string transduction based on a formal grammar provided in-context. We construct synchronous context-free grammars which define pairs of formal languages designed to model particular aspects of natural language grammar, morphology, and written representation. Using these grammars, we measure how well LLMs can translate sentences from one formal language into another when given both the grammar and the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
