How Big Are Peoples' Computer Files? File Size Distributions Among User-managed Collections
Jesse David Dinneen, Ba Xuan Nguyen

TL;DR
This study analyzes the distribution of file sizes in personal digital collections across different demographics and operating systems, revealing significant growth in average file size and implications for system design.
Contribution
It provides the first recent comprehensive analysis of personal file size distributions across various user groups and operating systems since 2013.
Findings
Average file size has increased over tenfold since the mid-2000s.
Most files remain under 8 MB in size.
Demographic and technological factors influence file size distributions.
Abstract
Improving file management interfaces and optimising system performance requires current data about users' digital collections and particularly about the file size distributions of such collections. However, prior works have examined only the sizes of system files and users' work files in varied contexts, and there has been no such study since 2013; it therefore remains unclear how today's file sizes are distributed, particularly personal files, and further if distributions differ among the major operating systems or common occupations. Here we examine such differences among 49 million files in 348 user collections. We find that the average file size has grown more than ten-fold since the mid-2000s, though most files are still under 8 MB, and that there are demographic and technological influences in the size distributions. We discuss the implications for user interfaces, system…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
