# Go Wide, Go Deep: Quantifying the Impact of Scientific Papers through   Influence Dispersion Trees

**Authors:** Dattatreya Mohapatra, Abhishek Maiti, Sumit Bhatia, Tanmoy, Chakraborty

arXiv: 1904.07579 · 2019-04-17

## TL;DR

This paper introduces Influence Dispersion Trees to analyze how scientific papers influence research fields, proposing new metrics like NID that outperform citation counts in predicting future impact and identifying influential papers.

## Contribution

It presents a novel data structure and metrics for quantifying a paper's influence by analyzing citation organization, enhancing impact assessment beyond simple citation counts.

## Key findings

- NID outperforms raw citation count in early impact prediction.
- NID better identifies highly influential papers recognized over time.
- Ideal IDT configuration has equal depth and breadth, minimizing NID.

## Abstract

Despite a long history of use of citation count as a measure to assess the impact or influence of a scientific paper, the evolution of follow-up work inspired by the paper and their interactions through citation links have rarely been explored to quantify how the paper enriches the depth and breadth of a research field. We propose a novel data structure, called Influence Dispersion Tree (IDT) to model the organization of follow-up papers and their dependencies through citations. We also propose the notion of an ideal IDT for every paper and show that an ideal (highly influential) paper should increase the knowledge of a field vertically and horizontally. Upon suitably exploring the structural properties of IDT, we derive a suite of metrics, namely Influence Dispersion Index (IDI), Normalized Influence Divergence (NID) to quantify the influence of a paper. Our theoretical analysis shows that an ideal IDT configuration should have equal depth and breadth (and thus minimize the NID value). We establish the superiority of NID as a better influence measure in two experimental settings. First, on a large real-world bibliographic dataset, we show that NID outperforms raw citation count as an early predictor of the number of new citations a paper will receive within a certain period after publication. Second, we show that NID is superior to the raw citation count at identifying the papers recognized as highly influential through Test of Time Award among all their contemporary papers (published in the same venue). We conclude that in order to quantify the influence of a paper, along with the total citation count, one should also consider how the citing papers are organized among themselves to better understand the influence of a paper on the research field. For reproducibility, the code and datasets used in this study are being made available to the community.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1904.07579/full.md

## Figures

15 figures with captions in the complete paper: https://tomesphere.com/paper/1904.07579/full.md

## References

43 references — full list in the complete paper: https://tomesphere.com/paper/1904.07579/full.md

---
Source: https://tomesphere.com/paper/1904.07579