APEX-EM: Non-Parametric Online Learning for Autonomous Agents via Structured Procedural-Episodic Experience Replay
Pratyay Banerjee, Masud Moshtaghi, Ankit Chadha

TL;DR
APEX-EM is a non-parametric online learning framework that enables autonomous agents to reuse structured procedural experiences, improving task performance without altering model weights.
Contribution
It introduces a novel structured experience representation and retrieval workflow that allows cross-domain transfer and enhances autonomous agent capabilities.
Findings
Achieves 89.6% accuracy on KGQAGen-10k, surpassing baseline and oracle retrieval.
Reaches 83.3% success rate on BigCodeBench, significantly outperforming previous methods.
Entity graph retrieval improves to 48.0%, demonstrating effective cross-domain transfer.
Abstract
LLM-based autonomous agents lack persistent procedural memory: they re-derive solutions from scratch even when structurally identical tasks have been solved before. We present APEX-EM, a non-parametric online learning framework that accumulates, retrieves, and reuses structured procedural plans without modifying model weights. APEX-EM introduces: (1) a structured experience representation encoding the full procedural-episodic trace of each execution -- planning steps, artifacts, iteration history with error analysis, and quality scores; (2) a Plan-Retrieve-Generate-Iterate-Ingest (PRGII) workflow with Task Verifiers providing multi-dimensional reward signals; and (3) a dual-outcome Experience Memory with hybrid retrieval combining semantic search, structural signature matching, and plan DAG traversal -- enabling cross-domain transfer between tasks sharing no lexical overlap but…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
