Hierarchical Long-Term Semantic Memory for LinkedIn's Hiring Agent
Zhentao Xu, Shangjing Zhang, Emir Poyraz, Yvonne Li, Ye Jin, Xie Lu, Xiaoyang Gu, Karthik Ramgopal, Praveen Kumar Bodigutla, Xiaofeng Wang

TL;DR
The paper presents HLTM, a hierarchical semantic memory system for LLM agents, enhancing long-term knowledge management with scalability, privacy, and low-latency retrieval, demonstrated in LinkedIn's Hiring Assistant.
Contribution
Introduces HLTM, a schema-aligned hierarchical memory framework that improves scalability, privacy, and retrieval efficiency for LLM agents in industrial applications.
Findings
HLTM improves answer correctness and retrieval F1 by over 10%.
HLTM advances the latency-performance Pareto frontier.
HLTM is deployed in LinkedIn's Hiring Assistant for core features.
Abstract
Large Language Model (LLM) agents are increasingly used in real-world products, where personalized and context-aware user interactions are essential. A central enabler of such capabilities is the agent's long-term semantic memory system, which extracts implicit and explicit signals from noisy longitudinal behavioral data, stores them in a structured form, and supports low-latency retrieval. Building industrial-grade long-term memory for LLM agents raises five challenges: scalability, low-latency retrieval, privacy constraints, cross-domain generalizability, and observability. We introduce the Hierarchical Long-Term Semantic Memory (HLTM) framework, which organizes textual data into a schema-aligned memory tree that captures semantic knowledge at multiple levels of granularity, enabling scalable ingestion, privacy-aware storage, low-latency retrieval, and transparent provenance; HLTM…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
