Generating large-scale network analyses of scientific landscapes in seconds using Dimensions on Google BigQuery
Michele Pasin, Richard Abdill

TL;DR
This paper introduces an open-source Python library that simplifies accessing and visualizing large-scale bibliometric data from Dimensions on Google BigQuery, enabling rapid analysis of scientific landscapes.
Contribution
The authors present a new Python tool that streamlines data retrieval and visualization from Dimensions, making complex bibliometric analyses more accessible and efficient.
Findings
Enables analysis of COVID-19 research patterns
Reduces technical barriers for data visualization
Facilitates large-scale scholarly landscape studies
Abstract
The growth of large, programatically accessible bibliometrics databases presents new opportunities for complex analyses of publication metadata. In addition to providing a wealth of information about authors and institutions, databases such as those provided by Dimensions also provide conceptual information and links to entities such as grants, funders and patents. However, data is not the only challenge in evaluating patterns in scholarly work: These large datasets can be challenging to integrate, particularly for those unfamiliar with the complex schemas necessary for accommodating such heterogeneous information, and those most comfortable with data mining may not be as experienced in data visualisation. Here, we present an open-source Python library that streamlines the process accessing and diagramming subsets of the Dimensions on Google BigQuery database and demonstrate its use on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComplex Network Analysis Techniques · Data Visualization and Analytics · Bioinformatics and Genomic Networks
