Learning Language Representations with Logical Inductive Bias

Jianshu Chen

arXiv:2302.09458·cs.CL·February 21, 2023

Learning Language Representations with Logical Inductive Bias

Jianshu Chen

PDF

Open Access 1 Video

TL;DR

This paper introduces FOLNet, a neural architecture embedding logical inductive bias into language models, enhancing reasoning and transfer capabilities beyond traditional transformer models.

Contribution

We develop FOLNet, a novel neural network encoding logical inductive bias with learnable Horn clauses, improving reasoning and transfer in language representations.

Findings

01

FOLNet outperforms existing transformer-based models on language understanding tasks.

02

Neural logic operators in FOLNet can compose self-attention modules.

03

Logical inductive bias enhances transfer capabilities of language models.

Abstract

Transformer architectures have achieved great success in solving natural language tasks, which learn strong language representations from large-scale unlabeled texts. In this paper, we seek to go further beyond and explore a new logical inductive bias for better language representation learning. Logic reasoning is known as a formal methodology to reach answers from given knowledge and facts. Inspired by such a view, we develop a novel neural architecture named FOLNet (First-Order Logic Network), to encode this new inductive bias. We construct a set of neural logic operators as learnable Horn clauses, which are further forward-chained into a fully differentiable neural architecture (FOLNet). Interestingly, we find that the self-attention module in transformers can be composed by two of our neural logic operators, which probably explains their strong reasoning performance. Our proposed…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Learning Language Representations with Logical Inductive Bias· slideslive

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Text Readability and Simplification