A Fully Attention-Based Information Retriever

Alvaro Henrique Chaim Correia; Jorge Luiz Moreira Silva; Thiago de; Castro Martins; Fabio Gagliardi Cozman

arXiv:1810.09580·cs.CL·October 24, 2018

A Fully Attention-Based Information Retriever

Alvaro Henrique Chaim Correia, Jorge Luiz Moreira Silva, Thiago de, Castro Martins, Fabio Gagliardi Cozman

PDF

1 Repo

TL;DR

This paper introduces FABIR, a fully attention-based neural network for question-answering that achieves competitive results on SQuAD with fewer parameters and faster processing compared to traditional RNN-based models.

Contribution

The paper presents a novel fully attention-based architecture for information retrieval, replacing recurrent networks with parallelizable attention mechanisms.

Findings

01

FABIR achieves competitive SQuAD scores.

02

FABIR has fewer parameters than RNN-based models.

03

FABIR is faster in training and inference.

Abstract

Recurrent neural networks are now the state-of-the-art in natural language processing because they can build rich contextual representations and process texts of arbitrary length. However, recent developments on attention mechanisms have equipped feedforward networks with similar capabilities, hence enabling faster computations due to the increase in the number of operations that can be parallelized. We explore this new type of architecture in the domain of question-answering and propose a novel approach that we call Fully Attention Based Information Retriever (FABIR). We show that FABIR achieves competitive results in the Stanford Question Answering Dataset (SQuAD) while having fewer parameters and being faster at both learning and inference than rival methods.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

AlCorreia/FABIR
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.