Popularity of arXiv.org within Computer Science
Charles Sutton, Linan Gong

TL;DR
This paper analyzes the increasing adoption of arXiv.org among computer scientists, revealing significant growth in preprint posting, especially in theoretical and machine learning areas, and discusses implications for peer review and research dissemination.
Contribution
It provides the first comprehensive measurement of arXiv usage in computer science, highlighting trends and variations across subfields and over time.
Findings
23% of papers on arXiv in 2017, up from 1% ten years earlier
Over 60% of papers in theoretical CS and machine learning are on arXiv
56% of arXiv preprints are posted before or during peer review
Abstract
It may seem surprising that, out of all areas of science, computer scientists have been slow to post electronic versions of papers on sites like arXiv.org. Instead, computer scientists have tended to place papers on our individual home pages, but this loses the benefits of aggregation, namely notification and browsing. But this is changing. More and more computer scientists are now using the arXiv. At the same time, there is ongoing discussion and controversy about how prepublication affects peer review, especially for double-blind conferences. This discussion is often carried out with precious little evidence of how popular prepublication is. We measure what percentage of papers in computer science are placed on the arXiv, by cross-referencing published papers in DBLP with e-prints on arXiv. We found: * Usage of arXiv.org has risen dramatically among the most selective…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Quality and Management · Scientific Computing and Data Management · Research Data Management Practices
