Code-switching in text and speech challenges information-theoretic speaker design
Debasmita Bhattacharya, Marten van Schijndel

TL;DR
This paper investigates factors influencing code-switching in multilingual speech and text, revealing that switches are not solely driven by predictability but also serve communicative purposes.
Contribution
It demonstrates that code-switching is not only related to low predictability but also used for signaling, challenging purely speaker-driven theories.
Findings
Low primary language predictability correlates with code-switching.
English productions at switch points are less predictable than Chinese alternatives.
Code-switching serves purposes beyond speaker-driven predictability.
Abstract
In this work, we use language modeling to investigate the factors that influence insertional code-switching. Code-switching occurs when a speaker alternates between one language variety (the primary language) and another (the secondary language), and is widely observed in multilingual contexts. Recent work has shown that code-switching is often correlated with areas of low predictability in the primary language, but it is unclear whether low primary language predictability only makes the secondary language relatively easier to produce at code-switching points - that is, purely speaker-driven code-switching - or whether code-switching is additionally used by speakers for other purposes, for instance to signal the need for greater attention on the part of listeners. In this paper, we use bilingual Chinese-English online forum posts and transcripts of spontaneous Chinese-English speech to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
