Neural Conversational QA: Learning to Reason v.s. Exploiting Patterns

Nikhil Verma; Abhishek Sharma; Dhiraj Madan; Danish; Contractor; Harshit Kumar; Sachindra Joshi

arXiv:1909.03759·cs.CL·October 12, 2020

Neural Conversational QA: Learning to Reason v.s. Exploiting Patterns

Nikhil Verma, Abhishek Sharma, Dhiraj Madan, Danish, Contractor, Harshit Kumar, Sachindra Joshi

PDF

2 Repos 1 Datasets

TL;DR

This paper investigates how neural conversational QA models learn and exploit dataset patterns, revealing their reliance on spurious clues, and introduces a modified dataset to improve model reasoning capabilities.

Contribution

The paper identifies spurious pattern exploitation in neural QA models and provides a modified dataset to reduce these biases, enhancing reasoning performance.

Findings

01

Neural models learn spurious dataset patterns.

02

Heuristic programs exploiting these patterns perform comparably to neural models.

03

Modified dataset reduces reliance on spurious clues.

Abstract

Neural Conversational QA tasks like ShARC require systems to answer questions based on the contents of a given passage. On studying recent state-of-the-art models on the ShARCQA task, we found indications that the models learn spurious clues/patterns in the dataset. Furthermore, we show that a heuristic-based program designed to exploit these patterns can have performance comparable to that of the neural models. In this paper we share our findings about four types of patterns found in the ShARC corpus and describe how neural models exploit them. Motivated by the aforementioned findings, we create and share a modified dataset that has fewer spurious patterns, consequently allowing models to learn better.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Datasets

nikhilweee/sharc_modified
dataset· 43 dl
43 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.