TL;DR
This paper introduces a theoretical framework based on Boolean function sensitivity to measure and predict the complexity of sequence classification tasks, linking sensitivity to model difficulty and success.
Contribution
It extends Boolean function sensitivity theory to sequence tasks, demonstrating how sensitivity influences model learnability and task difficulty in NLP.
Findings
Low-sensitivity functions are easier for models to learn.
Sensitivity is higher on challenging NLP tasks like GLUE.
Sensitivity predicts input difficulty for simple models.
Abstract
We introduce a theoretical framework for understanding and predicting the complexity of sequence classification tasks, using a novel extension of the theory of Boolean function sensitivity. The sensitivity of a function, given a distribution over input sequences, quantifies the number of disjoint subsets of the input sequence that can each be individually changed to change the output. We argue that standard sequence classification methods are biased towards learning low-sensitivity functions, so that tasks requiring high sensitivity are more difficult. To that end, we show analytically that simple lexical classifiers can only express functions of bounded sensitivity, and we show empirically that low-sensitivity functions are easier to learn for LSTMs. We then estimate sensitivity on 15 NLP tasks, finding that sensitivity is higher on challenging tasks collected in GLUE than on simple…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
