Active entailment encoding for explanation tree construction using parsimonious generation of hard negatives
Alex Bogatu, Zili Zhou, D\'onal Landers, Andr\'e Freitas

TL;DR
This paper introduces an active fine-tuning approach for entailment models that significantly improves the efficiency of constructing explanation trees in open-domain question answering by using hard negative sampling.
Contribution
It proposes a novel active premise selection method with iterative fine-tuning of Transformer models to assist in building explanation trees more efficiently.
Findings
Up to 20% improvement in premise selection accuracy.
Effective use of hard negative sampling for entailment encoding.
Enhanced support for multi-level explanation tree construction.
Abstract
Entailment trees have been proposed to simulate the human reasoning process of explanation generation in the context of open--domain textual question answering. However, in practice, manually constructing these explanation trees proves a laborious process that requires active human involvement. Given the complexity of capturing the line of reasoning from question to the answer or from claim to premises, the issue arises of how to assist the user in efficiently constructing multi--level entailment trees given a large set of available facts. In this paper, we frame the construction of entailment trees as a sequence of active premise selection steps, i.e., for each intermediate node in an explanation tree, the expert needs to annotate positive and negative examples of premise facts from a large candidate list. We then iteratively fine--tune pre--trained Transformer models with the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Explainable Artificial Intelligence (XAI)
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Softmax · Position-Wise Feed-Forward Layer · Layer Normalization · Adam · Byte Pair Encoding · Label Smoothing · Residual Connection
