Code-switched Language Models Using Dual RNNs and Same-Source   Pretraining

Saurabh Garg; Tanmay Parekh; Preethi Jyothi

arXiv:1809.01962·cs.CL·September 7, 2018

Code-switched Language Models Using Dual RNNs and Same-Source Pretraining

Saurabh Garg, Tanmay Parekh, Preethi Jyothi

PDF

TL;DR

This paper introduces dual RNN-based language models with same-source pretraining to improve code-switched language modeling, demonstrating significant perplexity reductions on Mandarin-English data.

Contribution

It presents a novel dual RNN architecture and a pretraining method using synthetic data, advancing code-switched language modeling techniques.

Findings

01

Perplexity reduced significantly on Mandarin-English task

02

Dual RNN units effectively model each language in code-switching

03

Pretraining with synthetic data improves language model performance

Abstract

This work focuses on building language models (LMs) for code-switched text. We propose two techniques that significantly improve these LMs: 1) A novel recurrent neural network unit with dual components that focus on each language in the code-switched text separately 2) Pretraining the LM using synthetic text from a generative model estimated using the training data. We demonstrate the effectiveness of our proposed techniques by reporting perplexities on a Mandarin-English task and derive significant reductions in perplexity.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.