Learning to Infer from Unlabeled Data: A Semi-supervised Learning   Approach for Robust Natural Language Inference

Mobashir Sadat; Cornelia Caragea

arXiv:2211.02971·cs.CL·November 8, 2022

Learning to Infer from Unlabeled Data: A Semi-supervised Learning Approach for Robust Natural Language Inference

Mobashir Sadat, Cornelia Caragea

PDF

Open Access 1 Repo

TL;DR

This paper introduces a semi-supervised learning framework for Natural Language Inference that leverages a conditional language model to generate hypotheses for unlabeled premises, significantly improving performance in low-resource scenarios.

Contribution

It proposes a novel SSL approach for NLI using BART to generate hypotheses, addressing the challenge of unlabeled sentence pairs in semi-supervised learning.

Findings

01

Substantial performance improvements on four NLI datasets in low-resource settings.

02

Effective utilization of unlabeled data through hypothesis generation with BART.

03

Code release facilitates reproducibility and further research.

Abstract

Natural Language Inference (NLI) or Recognizing Textual Entailment (RTE) aims at predicting the relation between a pair of sentences (premise and hypothesis) as entailment, contradiction or semantic independence. Although deep learning models have shown promising performance for NLI in recent years, they rely on large scale expensive human-annotated datasets. Semi-supervised learning (SSL) is a popular technique for reducing the reliance on human annotation by leveraging unlabeled data for training. However, despite its substantial success on single sentence classification tasks where the challenge in making use of unlabeled data is to assign "good enough" pseudo-labels, for NLI tasks, the nature of unlabeled data is more complex: one of the sentences in the pair (usually the hypothesis) along with the class label are missing from the data and require human annotations, which makes SSL…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

msadat3/ssl_for_nli
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Multimodal Machine Learning Applications

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Linear Layer · Multi-Head Attention · Dense Connections · Adam · Layer Normalization · Byte Pair Encoding · Residual Connection · Dropout