Deep Sentence Embedding Using Long Short-Term Memory Networks: Analysis and Application to Information Retrieval
Hamid Palangi, Li Deng, Yelong Shen, Jianfeng Gao, Xiaodong He,, Jianshu Chen, Xinying Song, Rabab Ward

TL;DR
This paper introduces an LSTM-based neural network model for sentence embedding that captures semantic meaning, automatically detects keywords, and significantly improves web document retrieval performance over existing methods.
Contribution
The paper presents a novel LSTM-RNN approach for sentence embedding trained on click data, demonstrating automatic keyword detection and superior retrieval accuracy.
Findings
LSTM-RNN effectively captures sentence semantics.
The model automatically detects salient keywords.
It outperforms existing embedding methods in web retrieval tasks.
Abstract
This paper develops a model that addresses sentence embedding, a hot topic in current natural language processing research, using recurrent neural networks with Long Short-Term Memory (LSTM) cells. Due to its ability to capture long term memory, the LSTM-RNN accumulates increasingly richer information as it goes through the sentence, and when it reaches the last word, the hidden layer of the network provides a semantic representation of the whole sentence. In this paper, the LSTM-RNN is trained in a weakly supervised manner on user click-through data logged by a commercial web search engine. Visualization and analysis are performed to understand how the embedding process works. The model is found to automatically attenuate the unimportant words and detects the salient keywords in the sentence. Furthermore, these detected keywords are found to automatically activate different cells of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
