Integrating Expert Knowledge into Logical Programs via LLMs
Franciszek G\'orski, Oskar Wysocki, Marco Valentino, Andre Freitas

TL;DR
This paper presents ExKLoP, a benchmarking framework for evaluating how well Large Language Models incorporate expert knowledge into logical reasoning tasks relevant to engineering safety and reliability.
Contribution
It introduces a systematic evaluation method and dataset for assessing LLMs' ability to generate correct logical rules from expert knowledge, including self-correction capabilities.
Findings
Most models produce syntactically correct code.
Performance in implementing logical rules varies across models.
Models show potential for self-improvement through iterative feedback.
Abstract
This paper introduces ExKLoP, a novel framework designed to evaluate how effectively Large Language Models (LLMs) integrate expert knowledge into logical reasoning systems. This capability is especially valuable in engineering, where expert knowledge-such as manufacturer-recommended operational ranges-can be directly embedded into automated monitoring systems. By mirroring expert verification steps, tasks like range checking and constraint validation help ensure system safety and reliability. Our approach systematically evaluates LLM-generated logical rules, assessing both syntactic fluency and logical correctness in these critical validation tasks. We also explore the models' capacity for self-correction via an iterative feedback loop based on code execution outcomes. ExKLoP presents an extensible dataset comprising 130 engineering premises, 950 prompts, and corresponding validation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSemantic Web and Ontologies · AI-based Problem Solving and Planning · Data Mining Algorithms and Applications
MethodsSparse Evolutionary Training
