OncoReason: Structuring Clinical Reasoning in LLMs for Robust and Interpretable Survival Prediction

Raghu Vamshi Hemadri; Geetha Krishna Guruju; Kristi Topollai; Anna Ewa Choromanska

arXiv:2510.17532·cs.CL·October 21, 2025

OncoReason: Structuring Clinical Reasoning in LLMs for Robust and Interpretable Survival Prediction

Raghu Vamshi Hemadri, Geetha Krishna Guruju, Kristi Topollai, Anna Ewa Choromanska

PDF

Open Access 1 Datasets

TL;DR

This paper introduces OncoReason, a multi-task framework that enhances large language models with structured clinical reasoning for more accurate and interpretable cancer survival predictions, using novel alignment strategies.

Contribution

It proposes a unified multi-task learning approach with alignment strategies like CoT prompting and reinforcement learning to improve interpretability and accuracy in clinical outcome prediction.

Findings

01

CoT prompting improves F1 by +6.0 and reduces MAE by 12%.

02

GRPO achieves state-of-the-art interpretability and predictive performance.

03

Biomedical LLMs often fail to produce valid reasoning traces.

Abstract

Predicting cancer treatment outcomes requires models that are both accurate and interpretable, particularly in the presence of heterogeneous clinical data. While large language models (LLMs) have shown strong performance in biomedical NLP, they often lack structured reasoning capabilities critical for high-stakes decision support. We present a unified, multi-task learning framework that aligns autoregressive LLMs with clinical reasoning for outcome prediction on the MSK-CHORD dataset. Our models are trained to jointly perform binary survival classification, continuous survival time regression, and natural language rationale generation. We evaluate three alignment strategies: (1) standard supervised fine-tuning (SFT), (2) SFT with Chain-of-Thought (CoT) prompting to elicit step-by-step reasoning, and (3) Group Relative Policy Optimization (GRPO), a reinforcement learning method that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Datasets

oncollm/cancer-reasoning-traces
dataset· 15 dl
15 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning in Healthcare · Artificial Intelligence in Healthcare and Education · Topic Modeling