Analysing Neural Language Models: Contextual Decomposition Reveals   Default Reasoning in Number and Gender Assignment

Jaap Jumelet; Willem Zuidema; Dieuwke Hupkes

arXiv:1909.08975·cs.CL·September 20, 2019

Analysing Neural Language Models: Contextual Decomposition Reveals Default Reasoning in Number and Gender Assignment

Jaap Jumelet, Willem Zuidema, Dieuwke Hupkes

PDF

1 Repo

TL;DR

This paper introduces a generalized method called GCD to analyze neural language models, revealing that they rely heavily on default reasoning rather than purely syntactic or semantic cues for tasks like agreement and co-reference.

Contribution

The paper presents GCD, a novel technique to dissect model decisions, uncovering the reliance on default reasoning in neural language models.

Findings

01

Models depend on default reasoning for syntactic agreement.

02

GCD effectively distinguishes semantic, syntactic, and bias influences.

03

Neural models show strong reliance on heuristics rather than true understanding.

Abstract

Extensive research has recently shown that recurrent neural language models are able to process a wide range of grammatical phenomena. How these models are able to perform these remarkable feats so well, however, is still an open question. To gain more insight into what information LSTMs base their decisions on, we propose a generalisation of Contextual Decomposition (GCD). In particular, this setup enables us to accurately distil which part of a prediction stems from semantic heuristics, which part truly emanates from syntactic cues and which part arise from the model biases themselves instead. We investigate this technique on tasks pertaining to syntactic agreement and co-reference resolution and discover that the model strongly relies on a default reasoning effect to perform these tasks.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

i-machine-think/diagnnose
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.