Use of NoSQL database and visualization techniques to analyze massive   scholarly article data from journals

Gouri Ginde; Snehanshu Saha; Archana Mathur; Harsha Vamsi; Sudeepa Roy; Dey; Swati Sampatrao Gambhire

arXiv:1805.00390·cs.DL·May 2, 2018

Use of NoSQL database and visualization techniques to analyze massive scholarly article data from journals

Gouri Ginde, Snehanshu Saha, Archana Mathur, Harsha Vamsi, Sudeepa Roy, Dey, Swati Sampatrao Gambhire

PDF

TL;DR

This paper demonstrates how NoSQL graph databases and visualization techniques can analyze large-scale scholarly article data from Google Scholar, revealing insights into authorship, domain shifts, and research trends.

Contribution

It introduces a method for scraping, storing, and visualizing massive scholarly data using NoSQL graph databases, enabling deeper analysis of research patterns and behaviors.

Findings

01

Visualization reveals domain shifts and emerging research areas.

02

Graph analysis detects cartel behavior among authors and journals.

03

Structured data storage facilitates complex bibliometric analysis.

Abstract

Visualization of the massive data is a challenging endeavor. Extracting data and providing graphical representations can aid in its effective utilization in terms of interpretation and knowledge discovery. Publishing research articles has become a way of life for academicians. The scholarly publications can shape-up the professional growth of authors and also expand the research and technological growth of a country, continent and other demographic regions. Scholarly articles have grown in gigantic numbers that are published in different domains by various journals. Information related to articles, authors, their affiliations, number of citations, country, publisher, references and other information is like a gold mine for statisticians and data analysts. This data when used skillfully, via visual analysis tool, can provide valuable understanding and can aid in deeper exposition for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.