Leveraging Large Language Models for Analyzing Blood Pressure Variations Across Biological Sex from Scientific Literature
Yuting Guo, Seyedeh Somayyeh Mousavi, Reza Sameni, Abeed Sarker

TL;DR
This study demonstrates the use of a large language model to extract blood pressure data from scientific literature, revealing variations across biological sex and showcasing the potential of LLMs in demographic health research.
Contribution
We developed an automated method using GPT-35-turbo to extract blood pressure statistics from millions of abstracts, enabling large-scale analysis of demographic variations.
Findings
LLMs can effectively extract blood pressure data from literature.
Significant BP differences observed between sexes.
Method proves scalable for demographic health studies.
Abstract
Hypertension, defined as blood pressure (BP) that is above normal, holds paramount significance in the realm of public health, as it serves as a critical precursor to various cardiovascular diseases (CVDs) and significantly contributes to elevated mortality rates worldwide. However, many existing BP measurement technologies and standards might be biased because they do not consider clinical outcomes, comorbidities, or demographic factors, making them inconclusive for diagnostic purposes. There is limited data-driven research focused on studying the variance in BP measurements across these variables. In this work, we employed GPT-35-turbo, a large language model (LLM), to automatically extract the mean and standard deviation values of BP for both males and females from a dataset comprising 25 million abstracts sourced from PubMed. 993 article abstracts met our predefined inclusion…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInfluenza Virus Research Studies
