Switch Point biased Self-Training: Re-purposing Pretrained Models for Code-Switching
Parul Chopra, Sai Krishna Rallabandi, Alan W Black, Khyathi Raghavi, Chandu

TL;DR
This paper introduces a switch-point biased self-training method to improve multilingual models' performance on code-switching tasks by leveraging unannotated data, addressing the challenge of intra-sentence language mixing.
Contribution
It proposes a novel self-training approach that leverages switch-point bias to enhance pretrained models for code-switching tasks, with comprehensive benchmarking and analysis.
Findings
Improved POS and NER performance on code-switching data.
Reduced performance gap at switch points.
Retained overall model performance across language pairs.
Abstract
Code-switching (CS), a ubiquitous phenomenon due to the ease of communication it offers in multilingual communities still remains an understudied problem in language processing. The primary reasons behind this are: (1) minimal efforts in leveraging large pretrained multilingual models, and (2) the lack of annotated data. The distinguishing case of low performance of multilingual models in CS is the intra-sentence mixing of languages leading to switch points. We first benchmark two sequence labeling tasks -- POS and NER on 4 different language pairs with a suite of pretrained models to identify the problems and select the best performing model, char-BERT, among them (addressing (1)). We then propose a self training method to repurpose the existing pretrained models using a switch-point bias by leveraging unannotated data (addressing (2)). We finally demonstrate that our approach performs…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Speech and dialogue systems
