MirrorWiC: On Eliciting Word-in-Context Representations from Pretrained Language Models
Qianchu Liu, Fangyu Liu, Nigel Collier, Anna Korhonen, Ivan Vuli\'c

TL;DR
MirrorWiC is an unsupervised fine-tuning method that enhances word-in-context representations in pretrained language models using contrastive learning on raw Wikipedia data, achieving competitive results without labeled sense data.
Contribution
It introduces a simple, fully unsupervised WiC fine-tuning approach that improves context-aware word representations across multiple languages and benchmarks.
Findings
Significant performance improvements over baseline PLMs.
Achieves results comparable to supervised models on some benchmarks.
Effective across monolingual, multilingual, and cross-lingual settings.
Abstract
Recent work indicated that pretrained language models (PLMs) such as BERT and RoBERTa can be transformed into effective sentence and word encoders even via simple self-supervised techniques. Inspired by this line of work, in this paper we propose a fully unsupervised approach to improving word-in-context (WiC) representations in PLMs, achieved via a simple and efficient WiC-targeted fine-tuning procedure: MirrorWiC. The proposed method leverages only raw texts sampled from Wikipedia, assuming no sense-annotated data, and learns context-aware word representations within a standard contrastive learning setup. We experiment with a series of standard and comprehensive WiC benchmarks across multiple languages. Our proposed fully unsupervised MirrorWiC models obtain substantial gains over off-the-shelf PLMs across all monolingual, multilingual and cross-lingual setups. Moreover, on some…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Linear Layer · Contrastive Learning · Mirror-BERT · Weight Decay · WordPiece · Layer Normalization · Dense Connections · Attention Dropout
