Syntax Representation in Word Embeddings and Neural Networks -- A Survey

Tomasz Limisiewicz; David Mare\v{c}ek

arXiv:2010.01063·cs.CL·October 5, 2020·1 cites

Syntax Representation in Word Embeddings and Neural Networks -- A Survey

Tomasz Limisiewicz, David Mare\v{c}ek

PDF

Open Access

TL;DR

This survey reviews how neural networks in NLP capture syntax implicitly, evaluating various models and representations across monolingual and multilingual tasks to understand their syntactic knowledge.

Contribution

It provides a comprehensive overview of methods for assessing syntactic information in neural network representations across different architectures and languages.

Findings

01

Neural networks encode syntax without explicit supervision.

02

Certain pre-trained models are more effective for syntactic transfer.

03

Multilingual models show varied syntactic encoding capabilities.

Abstract

Neural networks trained on natural language processing tasks capture syntax even though it is not provided as a supervision signal. This indicates that syntactic analysis is essential to the understating of language in artificial intelligence systems. This overview paper covers approaches of evaluating the amount of syntactic information included in the representations of words for different neural network architectures. We mainly summarize re-search on English monolingual data on language modeling tasks and multilingual data for neural machine translation systems and multilingual language models. We describe which pre-trained models and representations of language are best suited for transfer to syntactic tasks.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Text Readability and Simplification