mcBERT: Momentum Contrastive Learning with BERT for Zero-Shot Slot   Filling

Seong-Hwan Heo; WonKee Lee; Jong-Hyeok Lee

arXiv:2203.12940·cs.CL·June 29, 2022

mcBERT: Momentum Contrastive Learning with BERT for Zero-Shot Slot Filling

Seong-Hwan Heo, WonKee Lee, Jong-Hyeok Lee

PDF

Open Access

TL;DR

mcBERT introduces a momentum contrastive learning approach with BERT for zero-shot slot filling, achieving state-of-the-art results by learning more generalized and reliable representations for unseen domains.

Contribution

The paper proposes mcBERT, a novel zero-shot slot filling model that leverages momentum contrastive learning with BERT to improve generalization and performance.

Findings

01

Outperforms previous models on SNIPS benchmark

02

Achieves new state-of-the-art results

03

Each component of mcBERT contributes to performance gains

Abstract

Zero-shot slot filling has received considerable attention to cope with the problem of limited available data for the target domain. One of the important factors in zero-shot learning is to make the model learn generalized and reliable representations. For this purpose, we present mcBERT, which stands for momentum contrastive learning with BERT, to develop a robust zero-shot slot filling model. mcBERT uses BERT to initialize the two encoders, the query encoder and key encoder, and is trained by applying momentum contrastive learning. Our experimental results on the SNIPS benchmark show that mcBERT substantially outperforms the previous models, recording a new state-of-the-art. Besides, we also show that each component composing mcBERT contributes to the performance improvement.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · Geophysical Methods and Applications

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Multi-Head Attention · Attention Is All You Need · Linear Layer · Contrastive Learning · Dropout · Layer Normalization · Adam · Attention Dropout · Residual Connection