Probing for Understanding of English Verb Classes and Alternations in Large Pre-trained Language Models
David K. Yi, James V. Bruno, Jiayu Han, Peter Zukerman, Shane, Steinert-Threlkeld

TL;DR
This paper explores how well large pre-trained language models encode English verb classes and their alternations, revealing that contextual embeddings capture these syntactic properties with high accuracy, especially in upper layers.
Contribution
It demonstrates that PLMs encode verb alternation classes effectively, expanding previous probing studies by including sentence-level tasks and layer-wise analysis.
Findings
Contextual embeddings outperform non-contextual ones.
High accuracy in encoding verb alternation classes.
Upper layers of PLMs perform better in probing tasks.
Abstract
We investigate the extent to which verb alternation classes, as described by Levin (1993), are encoded in the embeddings of Large Pre-trained Language Models (PLMs) such as BERT, RoBERTa, ELECTRA, and DeBERTa using selectively constructed diagnostic classifiers for word and sentence-level prediction tasks. We follow and expand upon the experiments of Kann et al. (2019), which aim to probe whether static embeddings encode frame-selectional properties of verbs. At both the word and sentence level, we find that contextual embeddings from PLMs not only outperform non-contextual embeddings, but achieve astonishingly high accuracies on tasks across most alternation classes. Additionally, we find evidence that the middle-to-upper layers of PLMs achieve better performance on average than the lower layers across all probing tasks.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Text Readability and Simplification
MethodsAttention Is All You Need · Linear Layer · Weight Decay · How do I file a dispute with Expedia?*DisputeFastService · Layer Normalization · Multi-Head Attention · Attention Dropout · Refunds@Expedia|||How do I get a full refund from Expedia? · BERT · WordPiece
