Large Language Model for OWL Proofs

Hui Yang; Jiaoyan Chen; Uli Sattler

arXiv:2601.12444·cs.AI·January 21, 2026

Large Language Model for OWL Proofs

Hui Yang, Jiaoyan Chen, Uli Sattler

PDF

Open Access

TL;DR

This paper investigates the ability of Large Language Models to generate faithful, human-readable proofs for OWL ontologies, highlighting their strengths and limitations in complex reasoning scenarios.

Contribution

The study develops an automated dataset and evaluation framework for proof generation in OWL reasoning, providing new insights into LLM capabilities and challenges.

Findings

01

Some models perform well on simple cases but struggle with complexity.

02

Logical complexity impacts performance more than representation format.

03

Input noise and incompleteness significantly reduce LLM effectiveness.

Abstract

The ability of Large Language Models (LLMs) to perform reasoning tasks such as deduction has been widely investigated in recent years. Yet, their capacity to generate proofs-faithful, human-readable explanations of why conclusions follow-remains largely under explored. In this work, we study proof generation in the context of OWL ontologies, which are widely adopted for representing and reasoning over complex knowledge, by developing an automated dataset construction and evaluation framework. Our evaluation encompassing three sequential tasks for complete proving: Extraction, Simplification, and Explanation, as well as an additional task of assessing Logic Completeness of the premise. Through extensive experiments on widely used reasoning LLMs, we achieve important findings including: (1) Some models achieve overall strong results but remain limited on complex cases; (2) Logical…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Topic Modeling · Multimodal Machine Learning Applications