Planner-Auditor Twin: Agentic Discharge Planning with FHIR-Based LLM Planning, Guideline Recall, Optional Caching and Self-Improvement
Kaiyuan Wu, Aditya Nagori, Rishikesan Kamaleswaran

TL;DR
This paper presents a self-improving, FHIR-based framework for clinical discharge planning using LLMs, which enhances safety, coverage, and calibration through deterministic auditing and targeted self-replay mechanisms.
Contribution
It introduces a novel Planner-Auditor architecture that decouples generation from validation, enabling systematic reliability improvements without retraining the LLM.
Findings
Task coverage increased from 32% to 86%.
Calibration metrics improved significantly, reducing high-confidence errors.
Discrepancy buffering enhanced omission correction during replay.
Abstract
Objective: Large language models (LLMs) show promise for clinical discharge planning, but their use is constrained by hallucination, omissions, and miscalibrated confidence. We introduce a self-improving, cache-optional Planner-Auditor framework that improves safety and reliability by decoupling generation from deterministic validation and targeted replay. Materials and Methods: We implemented an agentic, retrospective, FHIR-native evaluation pipeline using MIMIC-IV-on-FHIR. For each patient, the Planner (LLM) generates a structured discharge action plan with an explicit confidence estimate. The Auditor is a deterministic module that evaluates multi-task coverage, tracks calibration (Brier score, ECE proxies), and monitors action-distribution drift. The framework supports two-tier self-improvement: (i) within-episode regeneration when enabled, and (ii) cross-episode discrepancy…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Healthcare · Artificial Intelligence in Healthcare and Education · Topic Modeling
