TL;DR
This paper introduces CLTS, a simple yet effective cross-lingual text classification method that uses minimal translation resources to transfer task-specific seed words, outperforming existing approaches across 18 languages.
Contribution
CLTS is a novel teacher-student framework that leverages limited cross-lingual resources to generate weak supervision, improving classification performance in low-resource languages.
Findings
Transferring 20 seed words yields superior results to state-of-the-art methods.
Using a monolingual BERT student boosts accuracy by up to 12%.
CLTS outperforms expensive multilingual models in diverse languages.
Abstract
Cross-lingual text classification alleviates the need for manually labeled documents in a target language by leveraging labeled documents from other languages. Existing approaches for transferring supervision across languages require expensive cross-lingual resources, such as parallel corpora, while less expensive cross-lingual representation learning approaches train classifiers without target labeled documents. In this work, we propose a cross-lingual teacher-student method, CLTS, that generates "weak" supervision in the target language using minimal cross-lingual resources, in the form of a small number of word translations. Given a limited translation budget, CLTS extracts and transfers only the most important task-specific seed words across languages and initializes a teacher classifier based on the translated seed words. Then, CLTS iteratively trains a more powerful student that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsLinear Layer · Dense Connections · Layer Normalization · WordPiece · Multi-Head Attention · Dropout · Linear Warmup With Linear Decay · Attention Dropout · Logistic Regression · Weight Decay
