End-Task Oriented Textual Entailment via Deep Explorations of Inter-Sentence Interactions
Wenpeng Yin, Hinrich Sch\"utze, Dan Roth

TL;DR
This paper introduces DEISTE, a novel deep learning model that enhances textual entailment detection by exploring inter-sentence interactions, significantly improving performance on the SciTail dataset and generalizing well to other entailment tasks.
Contribution
DEISTE employs dynamic convolution and position-aware attentive convolution to better model inter-sentence interactions for entailment detection, advancing the state of the art.
Findings
DEISTE achieves approximately 5% improvement over previous methods.
Pretrained DEISTE generalizes well to RTE-5 dataset.
Model effectively captures word-to-word interactions for entailment.
Abstract
This work deals with SciTail, a natural entailment challenge derived from a multi-choice question answering problem. The premises and hypotheses in SciTail were generated with no awareness of each other, and did not specifically aim at the entailment task. This makes it more challenging than other entailment data sets and more directly useful to the end-task -- question answering. We propose DEISTE (deep explorations of inter-sentence interactions for textual entailment) for this entailment task. Given word-to-word interactions between the premise-hypothesis pair (, ), DEISTE consists of: (i) a parameter-dynamic convolution to make important words in and play a dominant role in learnt representations; and (ii) a position-aware attentive convolution to encode the representation and position information of the aligned word pairs. Experiments show that DEISTE gets…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications
MethodsConvolution
