Understood in Translation, Transformers for Domain Understanding
Dimitrios Christofidellis, Matteo Manica, Leonidas Georgopoulos, Hans, Vandierendonck

TL;DR
This paper introduces a Transformer-based supervised machine learning method for automated domain definition from corpora, improving knowledge graph construction efficiency and quality, validated on multiple datasets including a new health domain dataset.
Contribution
The paper presents a novel Transformer-based approach for domain definition, enhancing automation and accuracy in knowledge graph generation from textual corpora.
Findings
Transformer model outperforms CNN and RNN baselines
Effective on multiple datasets including PubMed-based health domain
Lays foundation for fully automated KG generation
Abstract
Knowledge acquisition is the essential first step of any Knowledge Graph (KG) application. This knowledge can be extracted from a given corpus (KG generation process) or specified from an existing KG (KG specification process). Focusing on domain specific solutions, knowledge acquisition is a labor intensive task usually orchestrated and supervised by subject matter experts. Specifically, the domain of interest is usually manually defined and then the needed generation or extraction tools are utilized to produce the KG. Herein, we propose a supervised machine learning method, based on Transformers, for domain definition of a corpus. We argue why such automated definition of the domain's structure is beneficial both in terms of construction time and quality of the generated graph. The proposed method is extensively validated on three public datasets (WebNLG, NYT and DocRED) by comparing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Biomedical Text Mining and Ontologies · Natural Language Processing Techniques
