Translation from the Information Bottleneck Perspective: an Efficiency Analysis of Spatial Prepositions in Bitexts

Antoine Taroni; Ludovic Moncla; Frederique Laforest

arXiv:2603.19924·cs.CL·April 7, 2026

Translation from the Information Bottleneck Perspective: an Efficiency Analysis of Spatial Prepositions in Bitexts

Antoine Taroni, Ludovic Moncla, Frederique Laforest

PDF

TL;DR

This paper models translation as an Information Bottleneck optimization, analyzing spatial prepositions across languages to reveal that human translation aligns with efficiency principles predicted by IB theory.

Contribution

It introduces a novel IB-based framework for analyzing translation, applying it to spatial prepositions across multiple languages using bitexts.

Findings

01

Translations of prepositions are closer to the IB optimal frontier than counterfactuals.

02

A predictive model of similarity judgements achieved a Spearman correlation of 0.78.

03

Evidence suggests human translators exhibit communicative efficiency pressures.

Abstract

Efficient communication requires balancing informativity and simplicity when encoding meanings. The Information Bottleneck (IB) framework captures this trade-off formally, predicting that natural language systems cluster near an optimal accuracy-complexity frontier. While supported in visual domains such as colour and motion, linguistic stimuli such as words in sentential context remain unexplored. We address this gap by framing translation as an IB optimisation problem, treating source sentences as stimuli and target sentences as compressed meanings. This allows IB analyses to be performed directly on bitexts rather than controlled naming experiments. We applied this to spatial prepositions across English, German and Serbian translations of a French novel. To estimate informativity, we conducted a pile-sorting pilot-study (N=35) and obtained similarity judgements of pairs of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.