Hierarchical Long-Term Semantic Memory for LinkedIn's Hiring Agent

Zhentao Xu; Shangjing Zhang; Emir Poyraz; Yvonne Li; Ye Jin; Xie Lu; Xiaoyang Gu; Karthik Ramgopal; Praveen Kumar Bodigutla; Xiaofeng Wang

arXiv:2604.26197·cs.IR·April 30, 2026

Hierarchical Long-Term Semantic Memory for LinkedIn's Hiring Agent

Zhentao Xu, Shangjing Zhang, Emir Poyraz, Yvonne Li, Ye Jin, Xie Lu, Xiaoyang Gu, Karthik Ramgopal, Praveen Kumar Bodigutla, Xiaofeng Wang

PDF

TL;DR

The paper presents HLTM, a hierarchical semantic memory system for LLM agents, enhancing long-term knowledge management with scalability, privacy, and low-latency retrieval, demonstrated in LinkedIn's Hiring Assistant.

Contribution

Introduces HLTM, a schema-aligned hierarchical memory framework that improves scalability, privacy, and retrieval efficiency for LLM agents in industrial applications.

Findings

01

HLTM improves answer correctness and retrieval F1 by over 10%.

02

HLTM advances the latency-performance Pareto frontier.

03

HLTM is deployed in LinkedIn's Hiring Assistant for core features.

Abstract

Large Language Model (LLM) agents are increasingly used in real-world products, where personalized and context-aware user interactions are essential. A central enabler of such capabilities is the agent's long-term semantic memory system, which extracts implicit and explicit signals from noisy longitudinal behavioral data, stores them in a structured form, and supports low-latency retrieval. Building industrial-grade long-term memory for LLM agents raises five challenges: scalability, low-latency retrieval, privacy constraints, cross-domain generalizability, and observability. We introduce the Hierarchical Long-Term Semantic Memory (HLTM) framework, which organizes textual data into a schema-aligned memory tree that captures semantic knowledge at multiple levels of granularity, enabling scalable ingestion, privacy-aware storage, low-latency retrieval, and transparent provenance; HLTM…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.