Neural Network Acceptability Judgments
Alex Warstadt, Amanpreet Singh, Samuel R. Bowman

TL;DR
This paper evaluates neural networks' ability to judge sentence grammaticality using a new dataset, showing they outperform previous models but still fall short of human performance across various grammatical phenomena.
Contribution
Introduces the CoLA dataset for acceptability judgments and benchmarks neural network models, highlighting their strengths and limitations.
Findings
Models outperform previous unsupervised approaches on CoLA
Neural networks learn some grammatical generalizations
Models perform below human level on complex constructions
Abstract
This paper investigates the ability of artificial neural networks to judge the grammatical acceptability of a sentence, with the goal of testing their linguistic competence. We introduce the Corpus of Linguistic Acceptability (CoLA), a set of 10,657 English sentences labeled as grammatical or ungrammatical from published linguistics literature. As baselines, we train several recurrent neural network models on acceptability classification, and find that our models outperform unsupervised models by Lau et al (2016) on CoLA. Error-analysis on specific grammatical phenomena reveals that both Lau et al.'s models and ours learn systematic generalizations like subject-verb-object order. However, all models we test perform far below human level on a wide range of grammatical constructions.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗google-t5/t5-smallmodel· 1.9M dl· ♡ 5381.9M dl♡ 538
- 🤗google-t5/t5-largemodel· 451k dl· ♡ 253451k dl♡ 253
- 🤗google-t5/t5-11bmodel· 22k dl· ♡ 6922k dl♡ 69
- 🤗google-t5/t5-3bmodel· 428k dl· ♡ 52428k dl♡ 52
- 🤗google-t5/t5-basemodel· 1.8M dl· ♡ 7701.8M dl♡ 770
- 🤗Kamrani/t5-largemodel· 6 dl6 dl
- 🤗qiaoyi/Comment_Summarization4DesignTutormodel· 11 dl11 dl
- 🤗ybelkada/t5-11b-shardedmodel· 11 dl· ♡ 211 dl♡ 2
- 🤗michellehbn/brrrrmodel· ♡ 1♡ 1
- 🤗BrainStormersHakton/question-gen-T5-basemodel· 3 dl3 dl
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
