Viziometrics: Analyzing Visual Information in the Scientific Literature
Po-shen Lee, Jevin D. West, and Bill Howe

TL;DR
This paper introduces viziometrics, a new field analyzing visual content in scientific papers, revealing correlations between visual information use and scientific impact, and providing tools for visual data exploration.
Contribution
It classifies over 8 million figures in scientific literature into types using computer vision, establishing a foundation for analyzing visual information's role in research impact.
Findings
Higher impact papers include more diagrams.
Visual content distribution is consistent over time but varies across fields.
A visual browser tool was developed for analysis and exploration.
Abstract
Scientific results are communicated visually in the literature through diagrams, visualizations, and photographs. These information-dense objects have been largely ignored in bibliometrics and scientometrics studies when compared to citations and text. In this paper, we use techniques from computer vision and machine learning to classify more than 8 million figures from PubMed into 5 figure types and study the resulting patterns of visual information as they relate to impact. We find that the distribution of figures and figure types in the literature has remained relatively constant over time, but can vary widely across field and topic. Remarkably, we find a significant correlation between scientific impact and the use of visual information, where higher impact papers tend to include more diagrams, and to a lesser extent more plots and photographs. To explore these results and other…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
