Abductive Reasoning as Self-Supervision for Common Sense Question   Answering

Sathyanarayanan N. Aakur; Sudeep Sarkar

arXiv:1909.03099·cs.CL·September 16, 2019·1 cites

Abductive Reasoning as Self-Supervision for Common Sense Question Answering

Sathyanarayanan N. Aakur, Sudeep Sarkar

PDF

Open Access

TL;DR

This paper introduces an abductive reasoning method based on Pattern Theory to enable self-supervised domain adaptation in question answering models, reducing reliance on costly human annotations while maintaining high performance.

Contribution

The paper presents a novel abductive reasoning approach for self-supervised domain adaptation in question answering, leveraging pseudo-labels to match supervised performance levels.

Findings

01

Self-supervised training retains up to 75% of fully supervised performance.

02

The approach reduces the need for large annotated datasets.

03

Experiments on benchmarks validate the effectiveness of the method.

Abstract

Question answering has seen significant advances in recent times, especially with the introduction of increasingly bigger transformer-based models pre-trained on massive amounts of data. While achieving impressive results on many benchmarks, their performances appear to be proportional to the amount of training data available in the target domain. In this work, we explore the ability of current question-answering models to generalize - to both other domains as well as with restricted training data. We find that large amounts of training data are necessary, both for pre-training as well as fine-tuning to a task, for the models to perform well on the designated task. We introduce a novel abductive reasoning approach based on Grenander's Pattern Theory framework to provide self-supervised domain adaptation cues or "pseudo-labels," which can be used instead of expensive human annotations.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications