Clinical Trials Ontology Engineering with Large Language Models

Berkan \c{C}ak{\i}r

arXiv:2412.14387·cs.AI·December 20, 2024

Clinical Trials Ontology Engineering with Large Language Models

Berkan \c{C}ak{\i}r

PDF

Open Access

TL;DR

This paper demonstrates that large language models like GPT-3.5, GPT-4, and Llama3 can effectively automate the extraction and integration of clinical trial data, reducing time and costs while maintaining quality.

Contribution

It introduces a methodology leveraging LLMs for clinical trial ontology engineering, showing their viability compared to human efforts.

Findings

01

LLMs significantly reduce time and cost in ontology creation.

02

LLMs produce comparable quality to human-created ontologies.

03

Real-time data integration in medical research becomes feasible.

Abstract

Managing clinical trial information is currently a significant challenge for the medical industry, as traditional methods are both time-consuming and costly. This paper proposes a simple yet effective methodology to extract and integrate clinical trial data in a cost-effective and time-efficient manner. Allowing the medical industry to stay up-to-date with medical developments. Comparing time, cost, and quality of the ontologies created by humans, GPT3.5, GPT4, and Llama3 (8b & 70b). Findings suggest that large language models (LLM) are a viable option to automate this process both from a cost and time perspective. This study underscores significant implications for medical research where real-time data integration from clinical trials could become the norm.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBiomedical Text Mining and Ontologies · Semantic Web and Ontologies