Kastor: Fine-tuned Small Language Models for Shape-based Active Relation Extraction
Ringwald Celian, Gandon Fabien, Faron Catherine, Michel Franck, Abi Akl Hanna

TL;DR
Kastor is a framework that fine-tunes small language models for shape-based relation extraction, improving knowledge base completion and refinement in specialized domains through an innovative validation and learning process.
Contribution
Kastor introduces a novel approach that reformulates validation and employs iterative learning to enhance small language models for relation extraction from RDF data.
Findings
Improved model generalization and performance in relation extraction.
Effective refinement of noisy knowledge bases.
Capability to uncover new relevant facts.
Abstract
RDF pattern-based extraction is a compelling approach for fine-tuning small language models (SLMs) by focusing a relation extraction task on a specified SHACL shape. This technique enables the development of efficient models trained on limited text and RDF data. In this article, we introduce Kastor, a framework that advances this approach to meet the demands for completing and refining knowledge bases in specialized domains. Kastor reformulates the traditional validation task, shifting from single SHACL shape validation to evaluating all possible combinations of properties derived from the shape. By selecting the optimal combination for each training example, the framework significantly enhances model generalization and performance. Additionally, Kastor employs an iterative learning process to refine noisy knowledge bases, enabling the creation of robust models capable of uncovering…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Sentiment Analysis and Opinion Mining
