Contextualized Word Representations for Reading Comprehension

Shimi Salant; Jonathan Berant

arXiv:1712.03609·cs.CL·September 5, 2018

Contextualized Word Representations for Reading Comprehension

Shimi Salant, Jonathan Berant

PDF

1 Repo

TL;DR

This paper demonstrates that incorporating rich contextualized word representations from a large pre-trained language model significantly improves reading comprehension performance, achieving results comparable to state-of-the-art methods on SQuAD.

Contribution

It introduces the use of large pre-trained language models for contextualized word representations in reading comprehension tasks, highlighting the importance of context even when question and document are processed independently.

Findings

01

Significant performance improvements on SQuAD dataset.

02

Contextualized representations outperform traditional embeddings.

03

Model achieves state-of-the-art results with independent question and document processing.

Abstract

Reading a document and extracting an answer to a question about its content has attracted substantial attention recently. While most work has focused on the interaction between the question and the document, in this work we evaluate the importance of context when the question and document are processed independently. We take a standard neural architecture for this task, and show that by providing rich contextualized word representations from a large pre-trained language model as well as allowing the model to choose between context-dependent and context-independent word representations, we can obtain dramatic improvements and reach performance comparable to state-of-the-art on the competitive SQuAD dataset.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

shimisalant/CWR
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.