Construction of a Battery Research Knowledge Graph using a Global Open Catalog
Luca Foppiano, Sae Dieb, Malik Zain, Kazuki Kasama, Keitaro Sodeyama, Mikiko Tanifuji

TL;DR
This paper presents a pipeline to construct a comprehensive, author-centric knowledge graph for battery research using open bibliographic data, semantic keyphrase extraction, and linked data integration.
Contribution
It introduces a novel method combining semantic keyphrases and bibliographic data to build an interoperable, cross-institutional battery research knowledge graph.
Findings
Supports author similarity and community detection.
Enables exploratory search via a web interface.
Links to external data sources like Wikidata.
Abstract
Battery research is a rapidly growing and highly interdisciplinary field, making it increasingly difficult to track relevant expertise and identify potential collaborators across institutional boundaries. In this work, we present a pipeline for constructing an author-centric knowledge graph of battery research built on OpenAlex, a large-scale open bibliographic catalogue. For each author, we derive a weighted research descriptors vector that combines coarse-grained OpenAlex concepts with fine-grained keyphrases extracted from titles and abstracts using KeyBERT with ChatGPT (gpt-3.5-turbo) as the backend model, selected after evaluating multiple alternatives. Vector components are weighted by research descriptor origin, authorship position, and temporal recency. The framework is applied to a corpus of 189,581 battery-related works. The resulting vectors support author-author similarity…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
