Transfer of Structural Knowledge from Synthetic Languages

Mikhail Budnikov; Ivan Yamshchikov

arXiv:2505.15769·cs.CL·May 22, 2025

Transfer of Structural Knowledge from Synthetic Languages

Mikhail Budnikov, Ivan Yamshchikov

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper investigates how training on synthetic languages can improve transfer learning to English, analyzing embedding structures, introducing a new synthetic language, and proposing Tiny-Cloze Benchmark for better evaluation.

Contribution

It introduces a new synthetic language that enhances transfer to English and presents Tiny-Cloze Benchmark for evaluating models on linguistic tasks.

Findings

01

Fine-tuning on synthetic languages improves English transfer performance.

02

The new synthetic language outperforms previous ones in transfer tasks.

03

Tiny-Cloze Benchmark provides more informative evaluation for less powerful models.

Abstract

This work explores transfer learning from several synthetic languages to English. We investigate the structure of the embeddings in the fine-tuned models, the information they contain, and the capabilities of the fine-tuned models on simple linguistic tasks. We also introduce a new synthetic language that leads to better transfer to English than the languages used in previous research. Finally, we introduce Tiny-Cloze Benchmark - a new synthetic benchmark for natural language understanding that is more informative for less powerful models. We use Tiny-Cloze Benchmark to evaluate fine-tuned models in several domains demonstrating that fine-tuning on a new synthetic language allows for better performance on a variety of tasks.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

msh2481/language_transfer
jaxOfficial

Videos

Transfer of Structural Knowledge from Synthetic Languages· underline

Taxonomy

TopicsSemantic Web and Ontologies · Natural Language Processing Techniques