Attention Is Not Retention: The Orthogonality Constraint in Infinite-Context Architectures
Oliver Zahn, Matt Beton, Simran Chana

TL;DR
This paper identifies the semantic interference problem in neural memory systems caused by shared continuous representations, and proposes Knowledge Objects as a structured solution to enable reliable episodic memory in AI.
Contribution
It introduces the Orthogonality Constraint issue in neural memory, demonstrates its impact across modalities, and proposes Knowledge Objects as a novel structured memory approach.
Findings
Semantic interference occurs at N=5 facts with high similarity
Knowledge Objects significantly improve memory retrieval accuracy
Unstructured text storage leads to schema drift and ambiguity
Abstract
Biological memory solves a problem that eludes current AI: storing specific episodic facts without corrupting general semantic knowledge. Complementary Learning Systems theory explains this through two subsystems - a fast hippocampal system using sparse, pattern-separated representations for episodes, and a slow neocortical system using distributed representations for statistical regularities. Current AI systems lack this separation, attempting to serve both functions through neural weights alone. We identify the Orthogonality Constraint: reliable memory requires orthogonal keys, but semantic embeddings cannot be orthogonal because training clusters similar concepts together. The result is Semantic Interference (connecting to what cognitive psychologists have long observed in human memory), where neural systems writing facts into shared continuous parameters collapse to near-random…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFerroelectric and Negative Capacitance Devices · Cognitive Computing and Networks · Machine Learning and Algorithms
