Retrieval Augmentation for Deep Neural Networks

Rita Parada Ramos; Patr\'icia Pereira; Helena Moniz; Joao Paulo; Carvalho; Bruno Martins

arXiv:2102.13030·cs.CL·April 27, 2021

Retrieval Augmentation for Deep Neural Networks

Rita Parada Ramos, Patr\'icia Pereira, Helena Moniz, Joao Paulo, Carvalho, Bruno Martins

PDF

1 Repo

TL;DR

This paper introduces a retrieval augmentation method that leverages nearest training examples to improve deep neural network predictions during training and testing, demonstrating effectiveness in image captioning and sentiment analysis tasks.

Contribution

It proposes a novel retrieval-based augmentation technique that uses nearest training examples to enhance neural network performance during both training and inference.

Findings

01

Improved accuracy in image captioning on Flickr8 dataset.

02

Enhanced sentiment analysis results on IMDB dataset.

03

Code implementation is publicly available for reproducibility.

Abstract

Deep neural networks have achieved state-of-the-art results in various vision and/or language tasks. Despite the use of large training datasets, most models are trained by iterating over single input-output pairs, discarding the remaining examples for the current prediction. In this work, we actively exploit the training data, using the information from nearest training examples to aid the prediction both during training and testing. Specifically, our approach uses the target of the most similar training example to initialize the memory state of an LSTM model, or to guide attention mechanisms. We apply this approach to image captioning and sentiment analysis, respectively through image and text retrieval. Results confirm the effectiveness of the proposed approach for the two tasks, on the widely used Flickr8 and IMDB datasets. Our code is publicly available at…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

RitaRamo/retrieval-augmentation-nn
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsTanh Activation · Sigmoid Activation · Long Short-Term Memory