Deep Natural Language Processing for LinkedIn Search
Weiwei Guo, Xiaowei Liu, Sida Wang, Michaeel Kazi, Zhiwei Wang,, Zhoutong Fu, Jun Jia, Liang Zhang, Huiji Gao, Bo Long

TL;DR
This paper explores the application of deep NLP techniques, including BERT pre-training, to improve various search system tasks at LinkedIn, addressing challenges like latency and robustness.
Contribution
It provides a comprehensive study of deep NLP methods applied to five key search tasks and introduces BERT pre-training as a sixth versatile task.
Findings
Deep NLP improves query intent prediction and document ranking.
BERT pre-training enhances multiple search tasks.
Insights on latency and robustness challenges in large-scale deployment.
Abstract
Many search systems work with large amounts of natural language data, e.g., search queries, user profiles, and documents. Building a successful search system requires a thorough understanding of textual data semantics, where deep learning based natural language processing techniques (deep NLP) can be of great help. In this paper, we introduce a comprehensive study for applying deep NLP techniques to five representative tasks in search systems: query intent prediction (classification), query tagging (sequential tagging), document ranking (ranking), query auto completion (language modeling), and query suggestion (sequence to sequence). We also introduce BERT pre-training as a sixth task that can be applied to many of the other tasks. Through the model design and experiments of the six tasks, readers can find answers to four important questions: (1). When is deep NLP helpful/not helpful in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Information Retrieval and Search Behavior
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Weight Decay · Residual Connection · Adam · Dropout · Softmax · WordPiece · Layer Normalization
