First Train to Generate, then Generate to Train: UnitedSynT5 for   Few-Shot NLI

Sourav Banerjee; Anush Mahajan; Ayushi Agarwal; Eishkaran Singh

arXiv:2412.09263·cs.CL·December 16, 2024

First Train to Generate, then Generate to Train: UnitedSynT5 for Few-Shot NLI

Sourav Banerjee, Anush Mahajan, Ayushi Agarwal, Eishkaran Singh

PDF

Open Access

TL;DR

This paper introduces UnitedSynT5, a synthetic data augmentation method for NLI that improves accuracy on multiple datasets by generating and integrating additional premise-hypothesis pairs, surpassing previous state-of-the-art results.

Contribution

The paper presents UnitedSynT5, a novel synthetic data augmentation approach using a T5-based generator to enhance NLI training data and improve model performance.

Findings

01

Achieved 94.7% accuracy on SNLI dataset.

02

Surpassed previous SOTA models in NLI accuracy.

03

Demonstrated effectiveness of synthetic data augmentation in NLI.

Abstract

Natural Language Inference (NLI) tasks require identifying the relationship between sentence pairs, typically classified as entailment, contradiction, or neutrality. While the current state-of-the-art (SOTA) model, Entailment Few-Shot Learning (EFL), achieves a 93.1% accuracy on the Stanford Natural Language Inference (SNLI) dataset, further advancements are constrained by the dataset's limitations. To address this, we propose a novel approach leveraging synthetic data augmentation to enhance dataset diversity and complexity. We present UnitedSynT5, an advanced extension of EFL that leverages a T5-based generator to synthesize additional premise-hypothesis pairs, which are rigorously cleaned and integrated into the training data. These augmented examples are processed within the EFL framework, embedding labels directly into hypotheses for consistency. We train a GTR-T5-XL model on this…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNuclear Physics and Applications · Brain Tumor Detection and Classification