Logiformer: A Two-Branch Graph Transformer Network for Interpretable   Logical Reasoning

Fangzhi Xu; Jun Liu; Qika Lin; Yudai Pan; Lingling Zhang

arXiv:2205.00731·cs.CL·July 11, 2022

Logiformer: A Two-Branch Graph Transformer Network for Interpretable Logical Reasoning

Fangzhi Xu, Jun Liu, Qika Lin, Yudai Pan, Lingling Zhang

PDF

1 Repo

TL;DR

Logiformer introduces a two-branch graph transformer network that models logical and syntax relations in text, improving logical reasoning interpretability and performance on benchmarks.

Contribution

This work presents a novel end-to-end model that effectively captures long-distance dependencies and logical structures using dual graph transformers with dynamic gating.

Findings

01

Outperforms state-of-the-art models on logical reasoning benchmarks.

02

Effectively models long-distance dependencies among logical units.

03

Provides interpretable reasoning aligned with human cognition.

Abstract

Machine reading comprehension has aroused wide concerns, since it explores the potential of model for text understanding. To further equip the machine with the reasoning capability, the challenging task of logical reasoning is proposed. Previous works on logical reasoning have proposed some strategies to extract the logical units from different aspects. However, there still remains a challenge to model the long distance dependency among the logical units. Also, it is demanding to uncover the logical structures of the text and further fuse the discrete logic to the continuous text embedding. To tackle the above issues, we propose an end-to-end model Logiformer which utilizes a two-branch graph transformer network for logical reasoning of text. Firstly, we introduce different extraction strategies to split the text into two sets of logical units, and construct the logical graph and the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

xufangzhi/logiformer
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsAttention Is All You Need · Linear Layer · Absolute Position Encodings · Multi-Head Attention · Layer Normalization · Residual Connection · Softmax · Label Smoothing · Adam · Position-Wise Feed-Forward Layer