Pretraining with Artificial Language: Studying Transferable Knowledge in   Language Models

Ryokan Ri; Yoshimasa Tsuruoka

arXiv:2203.10326·cs.CL·March 23, 2022

Pretraining with Artificial Language: Studying Transferable Knowledge in Language Models

Ryokan Ri, Yoshimasa Tsuruoka

PDF

Open Access

TL;DR

This study explores how pretraining on artificial languages with structural features similar to natural language can transfer knowledge to natural language processing tasks, revealing insights into neural encoders' understanding of language structure.

Contribution

The paper introduces a method of pretraining with artificial languages to analyze transferability of structural knowledge in neural encoders, providing new insights into language understanding.

Findings

01

Pretraining on artificial languages with nesting structures transfers some knowledge to natural language tasks.

02

Transferability is linked to the amount of contextual information encoded.

03

Knowledge transferred includes position-aware context dependence.

Abstract

We investigate what kind of structural knowledge learned in neural network encoders is transferable to processing natural language. We design artificial languages with structural properties that mimic natural language, pretrain encoders on the data, and see how much performance the encoder exhibits on downstream tasks in natural language. Our experimental results show that pretraining with an artificial language with a nesting dependency structure provides some knowledge transferable to natural language. A follow-up probing analysis indicates that its success in the transfer is related to the amount of encoded contextual information and what is transferred is the knowledge of position-aware context dependence of language. Our results provide insights into how neural network encoders process human languages and the source of cross-lingual transferability of recent multilingual language…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications