Narrative Feature or Structured Feature? A Study of Large Language   Models to Identify Cancer Patients at Risk of Heart Failure

Ziyi Chen; Mengyuan Zhang; Mustafa Mohammed Ahmed; Yi Guo; Thomas J.; George; Jiang Bian; Yonghui Wu

arXiv:2403.11425·cs.LG·November 5, 2024·3 cites

Narrative Feature or Structured Feature? A Study of Large Language Models to Identify Cancer Patients at Risk of Heart Failure

Ziyi Chen, Mengyuan Zhang, Mustafa Mohammed Ahmed, Yi Guo, Thomas J., George, Jiang Bian, Yonghui Wu

PDF

Open Access 1 Repo

TL;DR

This study evaluates machine learning models, including large language models, to predict heart failure risk in cancer patients using electronic health records, demonstrating that narrative features significantly enhance prediction accuracy.

Contribution

It introduces novel narrative features derived from structured medical codes and compares their effectiveness across multiple ML models, highlighting the superior performance of GatorTron-3.9B.

Findings

01

GatorTron-3.9B achieved the highest F1 scores among models.

02

Narrative features increased feature density and improved model performance.

03

LLMs outperformed traditional ML and other transformer models in predicting HF risk.

Abstract

Cancer treatments are known to introduce cardiotoxicity, negatively impacting outcomes and survivorship. Identifying cancer patients at risk of heart failure (HF) is critical to improving cancer treatment outcomes and safety. This study examined machine learning (ML) models to identify cancer patients at risk of HF using electronic health records (EHRs), including traditional ML, Time-Aware long short-term memory (T-LSTM), and large language models (LLMs) using novel narrative features derived from the structured medical codes. We identified a cancer cohort of 12,806 patients from the University of Florida Health, diagnosed with lung, breast, and colorectal cancers, among which 1,602 individuals developed HF after cancer. The LLM, GatorTron-3.9B, achieved the best F1 scores, outperforming the traditional support vector machines by 39%, the T-LSTM deep learning model by 7%, and a widely…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

uf-hobi-informatics-lab/gatortron-prediction
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning in Healthcare

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Linear Layer · Layer Normalization · Multi-Head Attention · Linear Warmup With Linear Decay · Dropout · Dense Connections · Adam · Attention Dropout