Granularity of algorithmically constructed publication-level   classifications of research publications: Identification of specialties

Peter Sj\"og{\aa}rde; Per Ahlgren

arXiv:1901.05273·cs.DL·September 2, 2019

Granularity of algorithmically constructed publication-level classifications of research publications: Identification of specialties

Peter Sj\"og{\aa}rde, Per Ahlgren

PDF

TL;DR

This study proposes a methodology to determine the optimal granularity of publication classifications into research specialties by comparing clustering results with journal-based baselines, demonstrating its effectiveness through case studies.

Contribution

It introduces a novel approach to identify the appropriate level of specialty granularity in publication classifications using similarity measures and resolution parameters.

Findings

01

The methodology effectively distinguishes research specialties.

02

Class size variation in optimal classifications is moderate.

03

Small classes constitute a minor proportion of articles.

Abstract

In this work, in which we build on, and use the outcome of, an earlier study on topic identification in an algorithmically constructed publication-level classification (ACPLC), we address the issue how to algorithmically obtain a classification of topics (containing articles), where the classes of the classification correspond to specialties. The methodology we propose, which is similar to the one used in the earlier study, uses journals and their articles to construct a baseline classification. The underlying assumption of our approach is that journals of a particular size and foci have a scope that correspond to specialties. By measuring the similarity between (1) the baseline classification and (2) multiple classifications obtained by topic clustering and using different values of a resolution parameter, we have identified a best-performing ACPLC. In two case studies, we could…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.