Detecting Clinical Discrepancies in Health Coaching Agents: A Dual-Stream Memory and Reconciliation Architecture
Samuel L Pugh, Eric Yang, Alexander Muir Sutherland, Alessandra Breschi

TL;DR
This paper presents a dual-stream memory architecture with a reconciliation engine for health coaching agents, enabling accurate detection of discrepancies between patient reports and clinical records, crucial for safe longitudinal healthcare management.
Contribution
The introduction of a dual-stream memory system with a dedicated reconciliation engine to identify discrepancies between patient narratives and structured clinical data in health agents.
Findings
Engine detects 84.4% of clinical discrepancies.
Achieves 86.7% safety-critical recall.
Identifies 13.6% error cascade due to memory extraction issues.
Abstract
As Large Language Model (LLM) agents transition from single-session tools to persistent systems managing longitudinal healthcare journeys, their memory architectures face a critical challenge: reconciling two imperfect sources of truth. The patient's evolving self-report is current but prone to recall bias, while the Electronic Health Record (EHR) is medically validated but frequently stale. General-purpose agent memory systems optimize for coherence by overwriting older facts with the user's latest statement, a pattern that risks safety failures when applied to clinical data. We introduce a Dual-Stream Memory Architecture that strictly separates the patient narrative from the structured clinical record (FHIR), governed by a dedicated Reconciliation Engine that evaluates every extracted memory against the patient's FHIR profile and classifies discrepancies by type, severity, and the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
