TL;DR
This paper presents a novel architecture combining NLP and Machine Learning to extract and integrate entities and relationships from scientific publications into large-scale knowledge graphs, enhancing analysis and management of scholarly data.
Contribution
It introduces a hybrid system leveraging state-of-the-art NLP tools for knowledge extraction and demonstrates its application in building a comprehensive scientific knowledge graph.
Findings
Generated a knowledge graph with 109,105 triples from 26,827 abstracts.
Showed the hybrid system outperforms alternative approaches.
Applicable to any scientific domain for improved knowledge management.
Abstract
The continuous growth of scientific literature brings innovations and, at the same time, raises new challenges. One of them is related to the fact that its analysis has become difficult due to the high volume of published papers for which manual effort for annotations and management is required. Novel technological infrastructures are needed to help researchers, research policy makers, and companies to time-efficiently browse, analyse, and forecast scientific research. Knowledge graphs i.e., large networks of entities and relationships, have proved to be effective solution in this space. Scientific knowledge graphs focus on the scholarly domain and typically contain metadata describing research publications such as authors, venues, organizations, research topics, and citations. However, the current generation of knowledge graphs lacks of an explicit representation of the knowledge…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
