LSTM-based Deep Learning Models for Non-factoid Answer Selection

Ming Tan; Cicero dos Santos; Bing Xiang; Bowen Zhou

arXiv:1511.04108·cs.CL·March 29, 2016·403 cites

LSTM-based Deep Learning Models for Non-factoid Answer Selection

Ming Tan, Cicero dos Santos, Bing Xiang, Bowen Zhou

PDF

Open Access 2 Repos

TL;DR

This paper introduces LSTM-based deep learning models for non-factoid answer selection, combining CNNs and attention mechanisms to improve question-answer matching without manual features.

Contribution

It proposes novel LSTM-based models with CNN and attention mechanisms for answer selection, outperforming existing baselines on multiple datasets.

Findings

01

Models outperform strong baselines on TREC-QA and InsuranceQA datasets.

02

Combining CNNs with biLSTM improves representation quality.

03

Attention mechanisms enhance answer relevance modeling.

Abstract

In this paper, we apply a general deep learning (DL) framework for the answer selection task, which does not depend on manually defined features or linguistic tools. The basic framework is to build the embeddings of questions and answers based on bidirectional long short-term memory (biLSTM) models, and measure their closeness by cosine similarity. We further extend this basic model in two directions. One direction is to define a more composite representation for questions and answers by combining convolutional neural network with the basic framework. The other direction is to utilize a simple but efficient attention mechanism in order to generate the answer representation according to the question context. Several variations of models are provided. The models are examined by two datasets, including TREC-QA and InsuranceQA. Experimental results demonstrate that the proposed models…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Speech Recognition and Synthesis