Generating Knowledge Graphs by Employing Natural Language Processing and   Machine Learning Techniques within the Scholarly Domain

Danilo Dess\`i; Francesco Osborne; Diego Reforgiato Recupero; Davide; Buscaldi; Enrico Motta

arXiv:2011.01103·cs.CL·November 6, 2020

Generating Knowledge Graphs by Employing Natural Language Processing and Machine Learning Techniques within the Scholarly Domain

Danilo Dess\`i, Francesco Osborne, Diego Reforgiato Recupero, Davide, Buscaldi, Enrico Motta

PDF

1 Repo

TL;DR

This paper presents a novel architecture combining NLP and Machine Learning to extract and integrate entities and relationships from scientific publications into large-scale knowledge graphs, enhancing analysis and management of scholarly data.

Contribution

It introduces a hybrid system leveraging state-of-the-art NLP tools for knowledge extraction and demonstrates its application in building a comprehensive scientific knowledge graph.

Findings

01

Generated a knowledge graph with 109,105 triples from 26,827 abstracts.

02

Showed the hybrid system outperforms alternative approaches.

03

Applicable to any scientific domain for improved knowledge management.

Abstract

The continuous growth of scientific literature brings innovations and, at the same time, raises new challenges. One of them is related to the fact that its analysis has become difficult due to the high volume of published papers for which manual effort for annotations and management is required. Novel technological infrastructures are needed to help researchers, research policy makers, and companies to time-efficiently browse, analyse, and forecast scientific research. Knowledge graphs i.e., large networks of entities and relationships, have proved to be effective solution in this space. Scientific knowledge graphs focus on the scholarly domain and typically contain metadata describing research publications such as authors, venues, organizations, research topics, and citations. However, the current generation of knowledge graphs lacks of an explicit representation of the knowledge…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

danilo-dessi/skg
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.