Linguistic Frameworks Go Toe-to-Toe at Neuro-Symbolic Language Modeling

Jakob Prange; Nathan Schneider; Lingpeng Kong

arXiv:2112.07874·cs.CL·April 6, 2026

Linguistic Frameworks Go Toe-to-Toe at Neuro-Symbolic Language Modeling

Jakob Prange, Nathan Schneider, Lingpeng Kong

PDF

TL;DR

This paper investigates how linguistic graph representations, especially semantic constituency structures, can enhance neural language models, revealing their varying effectiveness across parts of speech and formalism types.

Contribution

It demonstrates that semantic constituency graphs outperform other formalism types in improving language modeling within a neuro-symbolic framework.

Findings

01

Semantic constituency structures are most beneficial for language modeling.

02

Effects of graph formalism vary significantly by part-of-speech.

03

Results suggest promising directions for neuro-symbolic language modeling.

Abstract

We examine the extent to which, in principle, linguistic graph representations can complement and improve neural language modeling. With an ensemble setup consisting of a pretrained Transformer and ground-truth graphs from one of 7 different formalisms, we find that, overall, semantic constituency structures are most useful to language modeling performance -- outpacing syntactic constituency structures as well as syntactic and semantic dependency structures. Further, effects vary greatly depending on part-of-speech class. In sum, our findings point to promising tendencies in neuro-symbolic language modeling and invite future research quantifying the design choices made by different formalisms.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.