Creating Causal Embeddings for Question Answering with Minimal   Supervision

Rebecca Sharp; Mihai Surdeanu; Peter Jansen; Peter Clark; and Michael; Hammond

arXiv:1609.08097·cs.CL·September 27, 2016

Creating Causal Embeddings for Question Answering with Minimal Supervision

Rebecca Sharp, Mihai Surdeanu, Peter Jansen, Peter Clark, and Michael, Hammond

PDF

TL;DR

This paper introduces a method for creating task-specific causal embeddings from minimal supervision, improving question answering performance by explicitly modeling causality.

Contribution

It presents a novel approach to generate causal embeddings using bootstrapped cause-effect pairs and incorporates them into QA reranking, with significant performance gains.

Findings

01

Causal embeddings improve QA accuracy.

02

Explicit causality modeling outperforms lexical similarity.

03

Method achieves 37.3% P@1 in causal QA.

Abstract

A common model for question answering (QA) is that a good answer is one that is closely related to the question, where relatedness is often determined using general-purpose lexical models such as word embeddings. We argue that a better approach is to look for answers that are related to the question in a relevant way, according to the information need of the question, which may be determined through task-specific embeddings. With causality as a use case, we implement this insight in three steps. First, we generate causal embeddings cost-effectively by bootstrapping cause-effect pairs extracted from free text using a small set of seed patterns. Second, we train dedicated embeddings over this data, by using task-specific contexts, i.e., the context of a cause is its effect. Finally, we extend a state-of-the-art reranking approach for QA to incorporate these causal embeddings. We evaluate…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.