Quantifying the Cognitive Extent of Science
Sta\v{s}a Milojevi\'c

TL;DR
This paper introduces a novel big-data method to measure the cognitive boundaries of scientific fields using lexical diversity in article titles, revealing that publication volume does not equate to cognitive growth and that smaller teams contribute more to cognitive expansion.
Contribution
The paper presents a new quantitative approach to assess the cognitive extent of scientific literature independently from publication volume, applied to large datasets across multiple disciplines.
Findings
Cognitive growth periods do not align with publication volume trends.
Larger research teams cover smaller cognitive territories than smaller teams.
The method achieves <1% precision in measuring lexical diversity in article titles.
Abstract
While the modern science is characterized by an exponential growth in scientific literature, the increase in publication volume clearly does not reflect the expansion of the cognitive boundaries of science. Nevertheless, most of the metrics for assessing the vitality of science or for making funding and policy decisions are based on productivity. Similarly, the increasing level of knowledge production by large science teams, whose results often enjoy greater visibility, does not necessarily mean that "big science" leads to cognitive expansion. Here we present a novel, big-data method to quantify the extents of cognitive domains of different bodies of scientific literature independently from publication volume, and apply it to 20 million articles published over 60-130 years in physics, astronomy, and biomedicine. The method is based on the lexical diversity of titles of fixed quotas of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
