A Deep Architecture for Semantic Matching with Multiple Positional Sentence Representations
Shengxian Wan, Yanyan Lan, Jiafeng Guo, Jun Xu, Liang Pang, and Xueqi, Cheng

TL;DR
This paper introduces a deep neural architecture that matches sentences by leveraging multiple positional representations generated by Bi-LSTM, effectively capturing contextualized local information for improved performance in tasks like question answering.
Contribution
The paper proposes a novel deep model that uses multiple positional sentence representations with Bi-LSTM for better sentence matching, surpassing existing single or multi-granularity approaches.
Findings
Outperforms existing models on question answering tasks
Effectively captures contextualized local information
Demonstrates superiority in sentence completion
Abstract
Matching natural language sentences is central for many applications such as information retrieval and question answering. Existing deep models rely on a single sentence representation or multiple granularity representations for matching. However, such methods cannot well capture the contextualized local information in the matching process. To tackle this problem, we present a new deep architecture to match two sentences with multiple positional sentence representations. Specifically, each positional sentence representation is a sentence representation at this position, generated by a bidirectional long short term memory (Bi-LSTM). The matching score is finally produced by aggregating interactions between these different positional sentence representations, through -Max pooling and a multi-layer perceptron. Our model has several advantages: (1) By using Bi-LSTM, rich context of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications
