Automating the Horae: Boundary-work in the age of computers
Luis Reyes-Galindo

TL;DR
This paper examines how arXiv's automated filtering systems efficiently manage large-scale submissions, highlighting the benefits and challenges of algorithmic boundary-setting in open access repositories.
Contribution
It provides a detailed case study of arXiv's transformation through sophisticated filtering algorithms and discusses the social implications of automated boundary-work.
Findings
Automated filtering significantly reduces human workload.
Filtering can inadvertently exclude some authors.
arXiv balances stability and innovation through boundary management.
Abstract
This paper describes the intense software filtering that has allowed the arXiv eprint repository to sort and process large numbers of submissions with minimal human intervention, making it one of the most important and influential cases of open access repositories to date. The paper narrates arXiv's transformation, using sophisticated sorting-filtering algorithms to decrease human workload, from a small mailing list used by a few hundred researchers to a site that processes thousands of papers per month. However there are significant negative consequences for authors who have been filtered out of the main categories. There is thus a continued need to check and balance arXiv's boundaries, based in the essential tension between stability and innovation.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
