Data Overdose? Time for a Quadruple Shot: Knowledge Graph Construction using Enhanced Triple Extraction
Taine J. Elliott, Stephen P. Levitt, Ken Nixon, Martin Bekker

TL;DR
This paper introduces a novel method for constructing biomedical knowledge graphs from PubMed abstracts using enhanced triple extraction with context variables, improving accuracy and enabling inference of new relationships.
Contribution
It presents a pipeline utilizing large language models and ontology-based methods to generate quadruples for knowledge graphs, incorporating context for improved semantic clarity.
Findings
Achieved an average cosine similarity of 0.874 for generated sentences from triples.
Enhanced triples with context variables showed increased similarity over ordinary triples.
Demonstrated the potential for LLMs to infer new relationships and connect knowledge clusters.
Abstract
The rapid expansion of publicly-available medical data presents a challenge for clinicians and researchers alike, increasing the gap between the volume of scientific literature and its applications. The steady growth of studies and findings overwhelms medical professionals at large, hindering their ability to systematically review and understand the latest knowledge. This paper presents an approach to information extraction and automatic knowledge graph (KG) generation to identify and connect biomedical knowledge. Through a pipeline of large language model (LLM) agents, the system decomposes 44 PubMed abstracts into semantically meaningful proposition sentences and extracts KG triples from these sentences. The triples are enhanced using a combination of open domain and ontology-based information extraction methodologies to incorporate ontological categories. On top of this, a context…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
