Iterative Learning of Computable Phenotypes for Treatment Resistant Hypertension using Large Language Models

Guilherme Seidyo Imai Aldeia; Daniel S. Herman; William G. La Cava

arXiv:2508.05581·cs.LG·August 8, 2025

Iterative Learning of Computable Phenotypes for Treatment Resistant Hypertension using Large Language Models

Guilherme Seidyo Imai Aldeia, Daniel S. Herman, William G. La Cava

PDF

TL;DR

This paper explores how large language models can generate and iteratively improve interpretable computable phenotypes for hypertension, achieving competitive accuracy with less data than traditional machine learning methods.

Contribution

It introduces a novel iterative learning strategy for LLMs to generate and refine computable phenotypes, enhancing interpretability and reducing data requirements.

Findings

01

LLMs can generate accurate computable phenotypes for hypertension.

02

Iterative learning improves phenotype quality and accuracy.

03

Approaches approach state-of-the-art ML performance with fewer examples.

Abstract

Large language models (LLMs) have demonstrated remarkable capabilities for medical question answering and programming, but their potential for generating interpretable computable phenotypes (CPs) is under-explored. In this work, we investigate whether LLMs can generate accurate and concise CPs for six clinical phenotypes of varying complexity, which could be leveraged to enable scalable clinical decision support to improve care for patients with hypertension. In addition to evaluating zero-short performance, we propose and test a synthesize, execute, debug, instruct strategy that uses LLMs to generate and iteratively refine CPs using data-driven feedback. Our results show that LLMs, coupled with iterative learning, can generate interpretable and reasonably accurate programs that approach the performance of state-of-the-art ML methods while requiring significantly fewer training examples.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.