Learning to Infer from Unlabeled Data: A Semi-supervised Learning Approach for Robust Natural Language Inference
Mobashir Sadat, Cornelia Caragea

TL;DR
This paper introduces a semi-supervised learning framework for Natural Language Inference that leverages a conditional language model to generate hypotheses for unlabeled premises, significantly improving performance in low-resource scenarios.
Contribution
It proposes a novel SSL approach for NLI using BART to generate hypotheses, addressing the challenge of unlabeled sentence pairs in semi-supervised learning.
Findings
Substantial performance improvements on four NLI datasets in low-resource settings.
Effective utilization of unlabeled data through hypothesis generation with BART.
Code release facilitates reproducibility and further research.
Abstract
Natural Language Inference (NLI) or Recognizing Textual Entailment (RTE) aims at predicting the relation between a pair of sentences (premise and hypothesis) as entailment, contradiction or semantic independence. Although deep learning models have shown promising performance for NLI in recent years, they rely on large scale expensive human-annotated datasets. Semi-supervised learning (SSL) is a popular technique for reducing the reliance on human annotation by leveraging unlabeled data for training. However, despite its substantial success on single sentence classification tasks where the challenge in making use of unlabeled data is to assign "good enough" pseudo-labels, for NLI tasks, the nature of unlabeled data is more complex: one of the sentences in the pair (usually the hypothesis) along with the class label are missing from the data and require human annotations, which makes SSL…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Multimodal Machine Learning Applications
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Linear Layer · Multi-Head Attention · Dense Connections · Adam · Layer Normalization · Byte Pair Encoding · Residual Connection · Dropout
