Evaluating improvements on using Large Language Models (LLMs) for   property extraction in the Open Research Knowledge Graph (ORKG)

Sandra Schaftner

arXiv:2502.10768·cs.IR·February 18, 2025

Evaluating improvements on using Large Language Models (LLMs) for property extraction in the Open Research Knowledge Graph (ORKG)

Sandra Schaftner

PDF

Open Access 1 Repo

TL;DR

This paper demonstrates that advanced prompt engineering significantly improves the performance of Large Language Models in extracting and matching properties in scientific literature for the Open Research Knowledge Graph, enhancing data quality and consistency.

Contribution

The study introduces advanced prompt engineering techniques and property matching methods to improve LLM-based property extraction in ORKG, addressing previous performance limitations.

Findings

01

Advanced prompts significantly increase property matching accuracy.

02

Enhanced property consistency aligns with FAIR principles.

03

Results improve the applicability of ORKG for research comparisons.

Abstract

Current research highlights the great potential of Large Language Models (LLMs) for constructing Scholarly Knowledge Graphs (SKGs). One particularly complex step in this process is relation extraction, aimed at identifying suitable properties to describe the content of research. This study builds directly on previous research of three Open Research Knowledge Graph (ORKG) team members who assessed the readiness of LLMs such as GPT-3.5, Llama 2, and Mistral for property extraction in scientific literature. Given the moderate performance observed, the previous work concluded that fine-tuning is needed to improve these models' alignment with scientific tasks and their emulation of human expertise. Expanding on this prior experiment, this study evaluates the impact of advanced prompt engineering techniques and demonstrates that these techniques can highly significantly enhance the results.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

SandraSchaftner/orkg_property_extraction_using_gpt-3.5
none

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling