Granularity of algorithmically constructed publication-level classifications of research publications: Identification of specialties
Peter Sj\"og{\aa}rde, Per Ahlgren

TL;DR
This study proposes a methodology to determine the optimal granularity of publication classifications into research specialties by comparing clustering results with journal-based baselines, demonstrating its effectiveness through case studies.
Contribution
It introduces a novel approach to identify the appropriate level of specialty granularity in publication classifications using similarity measures and resolution parameters.
Findings
The methodology effectively distinguishes research specialties.
Class size variation in optimal classifications is moderate.
Small classes constitute a minor proportion of articles.
Abstract
In this work, in which we build on, and use the outcome of, an earlier study on topic identification in an algorithmically constructed publication-level classification (ACPLC), we address the issue how to algorithmically obtain a classification of topics (containing articles), where the classes of the classification correspond to specialties. The methodology we propose, which is similar to the one used in the earlier study, uses journals and their articles to construct a baseline classification. The underlying assumption of our approach is that journals of a particular size and foci have a scope that correspond to specialties. By measuring the similarity between (1) the baseline classification and (2) multiple classifications obtained by topic clustering and using different values of a resolution parameter, we have identified a best-performing ACPLC. In two case studies, we could…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
