LinkedIn Post Embeddings: Industrial Scale Embedding Generation and Usage across LinkedIn
Sudarshan Srinivasa Ramanujam, Akanksha Bindal, Yu Jiang, Timothy J. Hazen, David Golland, Fengyu Zhang, Daqi Sun, Wanning Li, Birjodh Singh Tiwana, Siddharth Dangi, Peng Yan

TL;DR
This paper details LinkedIn's development of large-scale, fine-tuned post embeddings using transformer models, which enhance retrieval and ranking tasks across multiple products, demonstrating superior performance and real-world impact.
Contribution
Introduction of LinkedIn's industrial-scale post embedding system utilizing multi-task fine-tuning of transformer models for improved semantic understanding and application in various products.
Findings
Embeddings outperform baseline models in zero-shot learning.
Embeddings surpass OpenAI's ADA-001 and ADA-002 on LinkedIn datasets.
Deployment within minutes of post creation enables real-time application.
Abstract
A post embedding (representation of text in embedding space that effectively captures semantic meaning) is a foundational component of LinkedIn that is consumed by product surfaces in retrieval and ranking (e.g., ranking posts in the feed or video tab). This paper presents the post embeddings used at LinkedIn, where a pre-trained transformer-based large language model (LLM) is taken as input and fine-tuned using multi-task learning across a diverse set of semantic labeling tasks. We observe positive transfer, leading to improved performance across all tasks, compared to training them independently. The generated post embeddings outperform baseline models in zero-shot learning, demonstrating its potential for broader applicability. Furthermore, the generated post embeddings' performance surpasses that of OpenAI's ADA-001 and ADA-002 embeddings on LinkedIn specific datasets and tasks. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEducational Technology and Assessment
MethodsSparse Evolutionary Training · Contrastive Learning
