TL;DR
This paper introduces a bilingual neural model for color reference generation that captures cross-lingual regularities and context-dependent language use, demonstrating improved pragmatics and semantic understanding in English and Chinese.
Contribution
It presents a new bilingual dataset and a neural model that learns language-specific color systems and cross-lingual influences without parallel data, advancing pragmatic language modeling.
Findings
Bilingual model exhibits more human-like context dependence.
Model learns language-specific color term systems and cross-lingual influences.
Can identify synonyms across languages using vector analogies.
Abstract
Contextual influences on language often exhibit substantial cross-lingual regularities; for example, we are more verbose in situations that require finer distinctions. However, these regularities are sometimes obscured by semantic and syntactic differences. Using a newly-collected dataset of color reference games in Mandarin Chinese (which we release to the public), we confirm that a variety of constructions display the same sensitivity to contextual difficulty in Chinese and English. We then show that a neural speaker agent trained on bilingual data with a simple multitask learning approach displays more human-like patterns of context dependence and is more pragmatically informative than its monolingual Chinese counterpart. Moreover, this is not at the expense of language-specific semantic understanding: the resulting speaker model learns the different basic color term systems of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
