Social Commonsense Reasoning with Multi-Head Knowledge Attention

Debjit Paul; Anette Frank

arXiv:2010.05587·cs.CL·October 13, 2020

Social Commonsense Reasoning with Multi-Head Knowledge Attention

Debjit Paul, Anette Frank

PDF

1 Repo

TL;DR

This paper introduces a multi-head knowledge attention model that enhances social commonsense reasoning by integrating semi-structured inference rules into transformer models, improving performance on reasoning tasks like abductive inference and counterfactual prediction.

Contribution

The paper presents a novel multi-head knowledge attention mechanism that encodes inference rules and improves reasoning capabilities in transformer-based models, especially for social commonsense tasks.

Findings

01

Model outperforms RoBERTa on reasoning tasks

02

Counterfactual reasoning improves abductive inference

03

Robustness validated through knowledge perturbation

Abstract

Social Commonsense Reasoning requires understanding of text, knowledge about social events and their pragmatic implications, as well as commonsense reasoning skills. In this work we propose a novel multi-head knowledge attention model that encodes semi-structured commonsense inference rules and learns to incorporate them in a transformer-based reasoning cell. We assess the model's performance on two tasks that require different reasoning skills: Abductive Natural Language Inference and Counterfactual Invariance Prediction as a new task. We show that our proposed model improves performance over strong state-of-the-art models (i.e., RoBERTa) across both reasoning tasks. Notably we are, to the best of our knowledge, the first to demonstrate that a model that learns to perform counterfactual reasoning helps predicting the best explanation in an abductive reasoning task. We validate the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Heidelberg-NLP/MHKA
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.