Annotating FrameNet via Structure-Conditioned Language Generation

Xinyue Cui; Swabha Swayamdipta

arXiv:2406.04834·cs.CL·June 26, 2024

Annotating FrameNet via Structure-Conditioned Language Generation

Xinyue Cui, Swabha Swayamdipta

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper explores generating semantically structured sentences using language models conditioned on FrameNet, demonstrating high-quality outputs and potential for data augmentation in low-resource scenarios, but with limited benefits in high-resource settings.

Contribution

It introduces a framework for generating frame-semantic annotated sentences conditioned on semantic structures, advancing automatic linguistic annotation techniques.

Findings

01

Generated sentences are highly accepted by humans.

02

Semantic conditioning improves low-resource data augmentation.

03

Limited benefits observed in high-resource settings.

Abstract

Despite the remarkable generative capabilities of language models in producing naturalistic language, their effectiveness on explicit manipulation and generation of linguistic structures remain understudied. In this paper, we investigate the task of generating new sentences preserving a given semantic structure, following the FrameNet formalism. We propose a framework to produce novel frame-semantically annotated sentences following an overgenerate-and-filter approach. Our results show that conditioning on rich, explicit semantic information tends to produce generations with high human acceptance, under both prompting and finetuning. Our generated frame-semantic structured annotations are effective at training data augmentation for frame-semantic role labeling in low-resource settings; however, we do not see benefits under higher resource settings. Our study concludes that while…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

X-F-Cui/FrameNet-Conditional-Generation
pytorchOfficial

Videos

Annotating FrameNet via Structure-Conditioned Language Generation· underline

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling