Cognis: Context-Aware Memory for Conversational AI Agents

Parshva Daftari; Khush Patel; Shreyas Kapale; Jithin George; Siva Surendira

arXiv:2604.19771·cs.CL·April 23, 2026

Cognis: Context-Aware Memory for Conversational AI Agents

Parshva Daftari, Khush Patel, Shreyas Kapale, Jithin George, Siva Surendira

PDF

TL;DR

Cognis introduces a memory architecture for conversational AI that combines keyword and vector search, enabling persistent, context-aware memory and improved personalization in dialogue systems.

Contribution

A novel multi-stage retrieval pipeline integrating keyword and vector search with temporal and reranking enhancements for persistent conversational memory.

Findings

01

Achieves state-of-the-art results on LoCoMo and LongMemEval benchmarks.

02

Demonstrates effective memory retrieval across multiple answer generation models.

03

System is open-source and deployed in real-world applications.

Abstract

LLM agents lack persistent memory, causing conversations to reset each session and preventing personalization over time. We present Lyzr Cognis, a unified memory architecture for conversational AI agents that addresses this limitation through a multi-stage retrieval pipeline. Cognis combines a dual-store backend pairing OpenSearch BM25 keyword matching with Matryoshka vector similarity search, fused via Reciprocal Rank Fusion. Its context-aware ingestion pipeline retrieves existing memories before extraction, enabling intelligent version tracking that preserves full memory history while keeping the store consistent. Temporal boosting enhances time-sensitive queries, and a BGE-2 cross-encoder reranker refines final result quality. We evaluate Cognis on two independent benchmarks -- LoCoMo and LongMemEval -- across eight answer generation models, demonstrating state-of-the-art performance…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.