Understanding Main Path Analysis
H.C.W. Price, T.S. Evans

TL;DR
This paper provides a new theoretical foundation for main path analysis in citation networks, introduces basket-based methods for better core knowledge identification, and demonstrates their effectiveness over traditional approaches.
Contribution
It establishes an information-theoretic basis for main path analysis, proposes basket-based algorithms for more comprehensive core node detection, and shows their practical advantages in real networks.
Findings
Entropy-based variants optimize geometric distance measures.
Longest path approach yields similar results with simpler implementation.
Basket-based methods effectively identify core knowledge structures.
Abstract
Main path analysis has long been used to trace knowledge trajectories in citation networks, yet it lacks solid theoretical foundations. To understand when and why this approach succeeds, we analyse directed acyclic graphs created from two types of artificial models and by looking at over twenty networks derived from real data. We show that entropy-based variants of main path analysis optimise geometric distance measures, providing its first information-theoretic and geometric basis. Numerical results demonstrate that existing algorithms converge on near-geodesic solutions. We also show that an approach based on longest paths produces similar results, is equally well motivated yet is much simpler to implement. However, the traditional single-path focus is unnecessarily restrictive, as many near-optimal paths highlight different key nodes. We introduce an approach using ``baskets'' of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBioinformatics and Genomic Networks · Complex Network Analysis Techniques · Graph Theory and Algorithms
