Training Data is More Valuable than You Think: A Simple and Effective   Method by Retrieving from Training Data

Shuohang Wang; Yichong Xu; Yuwei Fang; Yang Liu; Siqi Sun; Ruochen Xu,; Chenguang Zhu; Michael Zeng

arXiv:2203.08773·cs.CL·March 17, 2022

Training Data is More Valuable than You Think: A Simple and Effective Method by Retrieving from Training Data

Shuohang Wang, Yichong Xu, Yuwei Fang, Yang Liu, Siqi Sun, Ruochen Xu,, Chenguang Zhu, Michael Zeng

PDF

1 Repo

TL;DR

Retrieving and concatenating similar training instances to input significantly improves NLP task performance, achieving state-of-the-art results with a simple, cost-effective method that leverages training data more effectively.

Contribution

Introducing REINA, a simple retrieval-based approach that enhances NLP models by utilizing training data, leading to substantial performance gains across multiple tasks.

Findings

01

Significant performance improvements on summarization, translation, and QA tasks.

02

Achieved state-of-the-art results on XSum, BigPatent, and CommonsenseQA.

03

Simple retrieval method outperforms many existing approaches.

Abstract

Retrieval-based methods have been shown to be effective in NLP tasks via introducing external knowledge. However, the indexing and retrieving of large-scale corpora bring considerable computational cost. Surprisingly, we found that REtrieving from the traINing datA (REINA) only can lead to significant gains on multiple NLG and NLU tasks. We retrieve the labeled training instances most similar to the input text and then concatenate them with the input to feed into the model to generate the output. Experimental results show that this simple method can achieve significantly better performance on a variety of NLU and NLG tasks, including summarization, machine translation, language modeling, and question answering tasks. For instance, our proposed method achieved state-of-the-art results on XSum, BigPatent, and CommonsenseQA. Our code is released, https://github.com/microsoft/REINA .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

microsoft/reina
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.