Emergent Linguistic Structures in Neural Networks are Fragile

Emanuele La Malfa; Matthew Wicker; Marta Kwiatkowska

arXiv:2210.17406·cs.LG·June 2, 2023

Emergent Linguistic Structures in Neural Networks are Fragile

Emanuele La Malfa, Matthew Wicker, Marta Kwiatkowska

PDF

Open Access 1 Repo

TL;DR

This paper investigates the robustness of linguistic structures in neural network models, revealing that emergent syntactic representations are fragile and can be disrupted by syntax-preserving perturbations, despite high performance on NLP tasks.

Contribution

It introduces a framework and measures for assessing the robustness of linguistic representations in language models, highlighting their fragility.

Findings

01

Context-free models can be competitive with modern LLMs in syntax tasks.

02

Emergent syntactic representations in neural networks are brittle.

03

Robustness of linguistic structures varies across models and datasets.

Abstract

Large Language Models (LLMs) have been reported to have strong performance on natural language processing tasks. However, performance metrics such as accuracy do not measure the quality of the model in terms of its ability to robustly represent complex linguistic structures. In this paper, focusing on the ability of language models to represent syntax, we propose a framework to assess the consistency and robustness of linguistic representations. To this end, we introduce measures of robustness of neural network models that leverage recent advances in extracting linguistic constructs from LLMs via probing tasks, i.e., simple tasks used to extract meaningful information about a single facet of a language model, such as syntax reconstruction and root identification. Empirically, we study the performance of four LLMs across six different corpora on the proposed robustness measures by…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

emanuelelm/emergent-linguistic-structures
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Text Readability and Simplification