A Comprehensive Comparison of Neural Networks as Cognitive Models of   Inflection

Adam Wiemerslage; Shiran Dudy; Katharina Kann

arXiv:2210.12321·cs.CL·October 25, 2022·1 cites

A Comprehensive Comparison of Neural Networks as Cognitive Models of Inflection

Adam Wiemerslage, Shiran Dudy, Katharina Kann

PDF

Open Access

TL;DR

This paper compares various neural network architectures to human judgments on morphological inflection tasks, finding Transformers align more closely with human behavior than LSTMs, challenging assumptions about accuracy and human-likeness.

Contribution

It provides a comprehensive comparison of neural network models as cognitive accounts of inflection, highlighting the Transformer’s superior alignment with human judgments over LSTMs.

Findings

01

Transformers better match human judgments than LSTMs.

02

LSTM features improving accuracy do not always enhance human-likeness.

03

Neural network architecture influences cognitive plausibility in inflection tasks.

Abstract

Neural networks have long been at the center of a debate around the cognitive mechanism by which humans process inflectional morphology. This debate has gravitated into NLP by way of the question: Are neural networks a feasible account for human behavior in morphological inflection? We address that question by measuring the correlation between human judgments and neural network probabilities for unknown word inflections. We test a larger range of architectures than previously studied on two important tasks for the cognitive processing debate: English past tense, and German number inflection. We find evidence that the Transformer may be a better account of human behavior than LSTMs on these datasets, and that LSTM features known to increase inflection accuracy do not always result in more human-like behavior.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques

MethodsMulti-Head Attention · Attention Is All You Need · Test · Linear Layer · Dense Connections · Softmax · Position-Wise Feed-Forward Layer · Adam · Tanh Activation · Label Smoothing