Ancestor-to-Creole Transfer is Not a Walk in the Park
Heather Lent, Emanuele Bugliarello, Anders S{\o}gaard

TL;DR
This paper investigates the challenges of transferring language models from ancestor languages to Creoles, revealing unique two-phase training patterns and questioning the practicality of ancestry transfer.
Contribution
It uncovers a distinct two-phase pattern in Creole language modeling and challenges the effectiveness of standard ancestry transfer methods for Creoles.
Findings
Standard transfer methods do not facilitate ancestry transfer in Creoles.
Creoles exhibit a unique two-phase training pattern with perplexity drops during overfitting.
Creoles show this pattern even with unrelated languages, indicating typological uniqueness.
Abstract
We aim to learn language models for Creole languages for which large volumes of data are not readily available, and therefore explore the potential transfer from ancestor languages (the 'Ancestry Transfer Hypothesis'). We find that standard transfer methods do not facilitate ancestry transfer. Surprisingly, different from other non-Creole languages, a very distinct two-phase pattern emerges for Creoles: As our training losses plateau, and language models begin to overfit on their source languages, perplexity on the Creoles drop. We explore if this compression phase can lead to practically useful language models (the 'Ancestry Bottleneck Hypothesis'), but also falsify this. Moreover, we show that Creoles even exhibit this two-phase pattern even when training on random, unrelated languages. Thus Creoles seem to be typological outliers and we speculate whether there is a link between the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Language and cultural evolution
