What do you learn from context? Probing for sentence structure in   contextualized word representations

Ian Tenney; Patrick Xia; Berlin Chen; Alex Wang; Adam Poliak; R Thomas; McCoy; Najoung Kim; Benjamin Van Durme; Samuel R. Bowman; Dipanjan Das; Ellie; Pavlick

arXiv:1905.06316·cs.CL·May 16, 2019·139 cites

What do you learn from context? Probing for sentence structure in contextualized word representations

Ian Tenney, Patrick Xia, Berlin Chen, Alex Wang, Adam Poliak, R Thomas, McCoy, Najoung Kim, Benjamin Van Durme, Samuel R. Bowman, Dipanjan Das, Ellie, Pavlick

PDF

Open Access 2 Repos

TL;DR

This paper introduces a new probing method to analyze how contextualized word representations encode sentence structure, revealing strengths in syntax but limited improvements in semantics.

Contribution

It presents a novel edge probing task design and evaluates multiple models, providing insights into their encoding of syntactic and semantic information.

Findings

01

Models excel at syntactic phenomena

02

Limited semantic understanding improvements

03

Probing reveals strengths and weaknesses in representations

Abstract

Contextualized representation models such as ELMo (Peters et al., 2018a) and BERT (Devlin et al., 2018) have recently achieved state-of-the-art results on a diverse array of downstream NLP tasks. Building on recent token-level probing work, we introduce a novel edge probing task design and construct a broad suite of sub-sentence tasks derived from the traditional structured NLP pipeline. We probe word-level contextual representations from four recent models and investigate how they encode sentence structure across a range of syntactic, semantic, local, and long-range phenomena. We find that existing models trained on language modeling and translation produce strong representations for syntactic phenomena, but only offer comparably small improvements on semantic tasks over a non-contextual baseline.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Text Readability and Simplification