Sensitivity as a Complexity Measure for Sequence Classification Tasks

Michael Hahn; Dan Jurafsky; Richard Futrell

arXiv:2104.10343·cs.CL·April 22, 2021

Sensitivity as a Complexity Measure for Sequence Classification Tasks

Michael Hahn, Dan Jurafsky, Richard Futrell

PDF

1 Repo

TL;DR

This paper introduces a theoretical framework based on Boolean function sensitivity to measure and predict the complexity of sequence classification tasks, linking sensitivity to model difficulty and success.

Contribution

It extends Boolean function sensitivity theory to sequence tasks, demonstrating how sensitivity influences model learnability and task difficulty in NLP.

Findings

01

Low-sensitivity functions are easier for models to learn.

02

Sensitivity is higher on challenging NLP tasks like GLUE.

03

Sensitivity predicts input difficulty for simple models.

Abstract

We introduce a theoretical framework for understanding and predicting the complexity of sequence classification tasks, using a novel extension of the theory of Boolean function sensitivity. The sensitivity of a function, given a distribution over input sequences, quantifies the number of disjoint subsets of the input sequence that can each be individually changed to change the output. We argue that standard sequence classification methods are biased towards learning low-sensitivity functions, so that tasks requiring high sensitivity are more difficult. To that end, we show analytically that simple lexical classifiers can only express functions of bounded sensitivity, and we show empirically that low-sensitivity functions are easier to learn for LSTMs. We then estimate sensitivity on 15 NLP tasks, finding that sensitivity is higher on challenging tasks collected in GLUE than on simple…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

m-hahn/sensitivity
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.