C-Rank: A Link-based Similarity Measure for Scientific Literature Databases
Seok-Ho Yoon, Sang-Wook Kim, and Sunju Park

TL;DR
C-Rank is a new link-based similarity measure for scientific papers that considers both in-links and out-links, effectively handling the unique citation patterns in scientific literature databases to improve retrieval accuracy.
Contribution
The paper introduces C-Rank, a novel similarity measure that disregards citation direction and addresses limitations of existing measures in scientific literature databases.
Findings
C-Rank outperforms existing similarity measures in accuracy.
It effectively handles old and recent papers with different citation patterns.
The proposed normalization method improves similarity assessment.
Abstract
As the number of people who use scientific literature databases grows, the demand for literature retrieval services has been steadily increased. One of the most popular retrieval services is to find a set of papers similar to the paper under consideration, which requires a measure that computes similarities between papers. Scientific literature databases exhibit two interesting characteristics that are different from general databases. First, the papers cited by old papers are often not included in the database due to technical and economic reasons. Second, since a paper references the papers published before it, few papers cite recently-published papers. These two characteristics cause all existing similarity measures to fail in at least one of the following cases: (1) measuring the similarity between old, but similar papers, (2) measuring the similarity between recent, but similar…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsWeb Data Mining and Analysis · Text and Document Classification Technologies · Advanced Clustering Algorithms Research
