An Empirical Revisiting of Linguistic Knowledge Fusion in Language   Understanding Tasks

Changlong Yu; Tianyi Xiao; Lingpeng Kong; Yangqiu Song; Wilfred Ng

arXiv:2210.13002·cs.CL·October 25, 2022

An Empirical Revisiting of Linguistic Knowledge Fusion in Language Understanding Tasks

Changlong Yu, Tianyi Xiao, Lingpeng Kong, Yangqiu Song, Wilfred Ng

PDF

Open Access 1 Repo

TL;DR

This paper empirically investigates the role of explicit linguistic priors in language understanding tasks, revealing that trivial graph structures can perform as well as linguistically informed ones, emphasizing the importance of baselines.

Contribution

The study challenges the assumed necessity of linguistic priors by showing trivial graphs can achieve similar performance, urging better baseline design for knowledge fusion methods.

Findings

01

Trivial graphs perform competitively with linguistically informed graphs.

02

Performance gains may stem from feature interactions rather than linguistic priors.

03

Trivial graphs should be used as baselines in future knowledge fusion research.

Abstract

Though linguistic knowledge emerges during large-scale language model pretraining, recent work attempt to explicitly incorporate human-defined linguistic priors into task-specific fine-tuning. Infusing language models with syntactic or semantic knowledge from parsers has shown improvements on many language understanding tasks. To further investigate the effectiveness of structural linguistic priors, we conduct empirical study of replacing parsed graphs or trees with trivial ones (rarely carrying linguistic knowledge e.g., balanced tree) for tasks in the GLUE benchmark. Encoding with trivial graphs achieves competitive or even better performance in fully-supervised and few-shot settings. It reveals that the gains might not be significantly attributed to explicit linguistic priors but rather to more feature interactions brought by fusion layers. Hence we call for attention to using…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

hkust-knowcomp/revisit-nlu-linguistic-knowledge
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Multimodal Machine Learning Applications