Active entailment encoding for explanation tree construction using   parsimonious generation of hard negatives

Alex Bogatu; Zili Zhou; D\'onal Landers; Andr\'e Freitas

arXiv:2208.01376·cs.CL·August 3, 2022·1 cites

Active entailment encoding for explanation tree construction using parsimonious generation of hard negatives

Alex Bogatu, Zili Zhou, D\'onal Landers, Andr\'e Freitas

PDF

Open Access

TL;DR

This paper introduces an active fine-tuning approach for entailment models that significantly improves the efficiency of constructing explanation trees in open-domain question answering by using hard negative sampling.

Contribution

It proposes a novel active premise selection method with iterative fine-tuning of Transformer models to assist in building explanation trees more efficiently.

Findings

01

Up to 20% improvement in premise selection accuracy.

02

Effective use of hard negative sampling for entailment encoding.

03

Enhanced support for multi-level explanation tree construction.

Abstract

Entailment trees have been proposed to simulate the human reasoning process of explanation generation in the context of open--domain textual question answering. However, in practice, manually constructing these explanation trees proves a laborious process that requires active human involvement. Given the complexity of capturing the line of reasoning from question to the answer or from claim to premises, the issue arises of how to assist the user in efficiently constructing multi--level entailment trees given a large set of available facts. In this paper, we frame the construction of entailment trees as a sequence of active premise selection steps, i.e., for each intermediate node in an explanation tree, the expert needs to annotate positive and negative examples of premise facts from a large candidate list. We then iteratively fine--tune pre--trained Transformer models with the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Explainable Artificial Intelligence (XAI)

MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Softmax · Position-Wise Feed-Forward Layer · Layer Normalization · Adam · Byte Pair Encoding · Label Smoothing · Residual Connection