A Systematic Study of Compositional Syntactic Transformer Language Models

Yida Zhao; Hao Xve; Xiang Hu; Kewei Tu

arXiv:2506.22978·cs.CL·July 1, 2025

A Systematic Study of Compositional Syntactic Transformer Language Models

Yida Zhao, Hao Xve, Xiang Hu, Kewei Tu

PDF

Open Access 1 Video

TL;DR

This paper systematically evaluates compositional syntactic language models based on constituency parse trees, providing insights and recommendations for their design across various NLP tasks.

Contribution

It introduces a unified framework for compositional SLMs, encompassing existing and new variants, and offers comprehensive empirical evaluation and design recommendations.

Findings

01

Identifies key design choices impacting model performance

02

Provides a unified framework for compositional SLMs

03

Recommends best practices for model design

Abstract

Syntactic language models (SLMs) enhance Transformers by incorporating syntactic biases through the modeling of linearized syntactic parse trees alongside surface sentences. This paper focuses on compositional SLMs that are based on constituency parse trees and contain explicit bottom-up composition of constituent representations. We identify key aspects of design choices in existing compositional SLMs and propose a unified framework encompassing both existing models and novel variants. We conduct a comprehensive empirical evaluation of all the variants in our framework across language modeling, syntactic generalization, summarization, dialogue, and inference efficiency. Based on the experimental results, we make multiple recommendations on the design of compositional SLMs. Our code is released at https://github.com/zhaoyd1/compositional_SLMs.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

A Systematic Study of Compositional Syntactic Transformer Language Models· underline

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Computational and Text Analysis Methods