MatKG: An autonomously generated knowledge graph in Material Science
Vineeth Venugopal, Elsa Olivetti

TL;DR
MatKG is a large knowledge graph in materials science that organizes data from scientific literature to aid material discovery and analysis.
Contribution
MatKG is the largest autonomously generated knowledge graph in materials science, containing over 70,000 entities and 5.4 million triples.
Findings
MatKG includes entities like materials, properties, and synthesis methods extracted via natural language processing.
The graph is available in CSV and RDF formats, with code and data shared publicly for research use.
MatKG supports applications such as material discovery and recommendation systems.
Abstract
In this paper, we present MatKG, a knowledge graph in materials science that offers a repository of entities and relationships extracted from scientific literature. Using advanced natural language processing techniques, MatKG includes an array of entities, including materials, properties, applications, characterization and synthesis methods, descriptors, and symmetry phase labels. The graph is formulated based on statistical metrics, encompassing over 70,000 entities and 5.4 million unique triples. To enhance accessibility and utility, we have serialized MatKG in both CSV and RDF formats and made these, along with the code base, available to the research community. As the largest knowledge graph in materials science to date, MatKG provides structured organization of domain-specific data. Its deployment holds promise for various applications, including material discovery, recommendation…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRisk Management in Financial Firms · Corporate Social Responsibility Reporting · Auditing, Earnings Management, Governance
