Learning Semantic Structure through First-Order-Logic Translation

Akshay Chaturvedi; Nicholas Asher

arXiv:2410.03203·cs.CL·October 7, 2024

Learning Semantic Structure through First-Order-Logic Translation

Akshay Chaturvedi, Nicholas Asher

PDF

Open Access 1 Video

TL;DR

This paper investigates how transformer-based language models can learn predicate argument structures, comparing question answering and first-order logic translation tasks, with a focus on their generalization abilities and the effectiveness of finetuning versus prompting.

Contribution

It demonstrates that finetuning large language models on FOL translation tasks enhances their ability to understand predicate argument structures compared to other methods.

Findings

01

FOL translation improves predicate-argument extraction in LLMs.

02

Finetuning yields better generalization than prompting.

03

LLMs excel at FOL translation over Q/A for semantic structure learning.

Abstract

In this paper, we study whether transformer-based language models can extract predicate argument structure from simple sentences. We firstly show that language models sometimes confuse which predicates apply to which objects. To mitigate this, we explore two tasks: question answering (Q/A), and first order logic (FOL) translation, and two regimes, prompting and finetuning. In FOL translation, we finetune several large language models on synthetic datasets designed to gauge their generalization abilities. For Q/A, we finetune encoder models like BERT and RoBERTa and use prompting for LLMs. The results show that FOL translation for LLMs is better suited to learn predicate argument structure.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Learning Semantic Structure through First-Order-Logic Translation· underline

Taxonomy

TopicsNatural Language Processing Techniques

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Linear Layer · Softmax · Multi-Head Attention · Layer Normalization · Dense Connections · Adam · WordPiece · Attention Dropout