TrajOnco: a multi-agent framework for temporal reasoning over longitudinal EHR for multi-cancer early detection

Sihang Zeng; Young Won Kim; Wilson Lau; Ehsan Alipour; Ruth Etzioni; Meliha Yetisgen; Anand Oka

arXiv:2604.10386·cs.AI·April 14, 2026

TrajOnco: a multi-agent framework for temporal reasoning over longitudinal EHR for multi-cancer early detection

Sihang Zeng, Young Won Kim, Wilson Lau, Ehsan Alipour, Ruth Etzioni, Meliha Yetisgen, Anand Oka

PDF

TL;DR

TrajOnco is a multi-agent LLM framework that performs temporal reasoning over longitudinal EHR data for multi-cancer early detection, achieving competitive risk prediction and interpretability.

Contribution

The paper introduces TrajOnco, a novel multi-agent LLM architecture with long-term memory for scalable, interpretable multi-cancer risk prediction from EHRs.

Findings

01

Achieved AUROCs of 0.64-0.80 in zero-shot risk prediction across 15 cancers.

02

Performed comparably to supervised models in lung cancer detection.

03

Enabled effective temporal reasoning with smaller models like GPT-4.1-mini.

Abstract

Accurate estimation of cancer risk from longitudinal electronic health records (EHRs) could support earlier detection and improved care, but modeling such complex patient trajectories remains challenging. We present TrajOnco, a training-free, multi-agent large language model (LLM) framework designed for scalable multi-cancer early detection. Using a chain-of-agents architecture with long-term memory, TrajOnco performs temporal reasoning over sequential clinical events to generate patient-level summaries, evidence-linked rationales, and predicted risk scores. We evaluated TrajOnco on de-identified Truveta EHR data across 15 cancer types using matched case-control cohorts, predicting risk of cancer diagnosis at 1 year. In zero-shot evaluation, TrajOnco achieved AUROCs of 0.64-0.80, performing comparably to supervised machine learning in a lung cancer benchmark while demonstrating better…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.