First Train to Generate, then Generate to Train: UnitedSynT5 for Few-Shot NLI
Sourav Banerjee, Anush Mahajan, Ayushi Agarwal, Eishkaran Singh

TL;DR
This paper introduces UnitedSynT5, a synthetic data augmentation method for NLI that improves accuracy on multiple datasets by generating and integrating additional premise-hypothesis pairs, surpassing previous state-of-the-art results.
Contribution
The paper presents UnitedSynT5, a novel synthetic data augmentation approach using a T5-based generator to enhance NLI training data and improve model performance.
Findings
Achieved 94.7% accuracy on SNLI dataset.
Surpassed previous SOTA models in NLI accuracy.
Demonstrated effectiveness of synthetic data augmentation in NLI.
Abstract
Natural Language Inference (NLI) tasks require identifying the relationship between sentence pairs, typically classified as entailment, contradiction, or neutrality. While the current state-of-the-art (SOTA) model, Entailment Few-Shot Learning (EFL), achieves a 93.1% accuracy on the Stanford Natural Language Inference (SNLI) dataset, further advancements are constrained by the dataset's limitations. To address this, we propose a novel approach leveraging synthetic data augmentation to enhance dataset diversity and complexity. We present UnitedSynT5, an advanced extension of EFL that leverages a T5-based generator to synthesize additional premise-hypothesis pairs, which are rigorously cleaned and integrated into the training data. These augmented examples are processed within the EFL framework, embedding labels directly into hypotheses for consistency. We train a GTR-T5-XL model on this…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNuclear Physics and Applications · Brain Tumor Detection and Classification
