The Compute Divide in Machine Learning: A Threat to Academic Contribution and Scrutiny?
Tamay Besiroglu, Sage Andrus Bergerson, Amelia Michael, Lennart Heim,, Xueyun Luo, Neil Thompson

TL;DR
This paper investigates how disparities in computing resources between industry and academia influence AI research, potentially limiting academic contributions, scrutiny, and the diffusion of advanced models, and proposes solutions to mitigate these effects.
Contribution
It provides a data-driven analysis of the compute divide's impact on academic research focus, scrutiny, and the shift towards industry-developed models, and suggests policy recommendations.
Findings
Academic research is less involved in compute-intensive topics like foundation models.
There is a shift towards open source, industry-developed pre-trained models in academia.
Reduced academic scrutiny may impact the evaluation and safety of influential models.
Abstract
There are pronounced differences in the extent to which industrial and academic AI labs use computing resources. We provide a data-driven survey of the role of the compute divide in shaping machine learning research. We show that a compute divide has coincided with a reduced representation of academic-only research teams in compute intensive research topics, especially foundation models. We argue that, academia will likely play a smaller role in advancing the associated techniques, providing critical evaluation and scrutiny, and in the diffusion of such models. Concurrent with this change in research focus, there is a noticeable shift in academic research towards embracing open source, pre-trained models developed within the industry. To address the challenges arising from this trend, especially reduced scrutiny of influential models, we recommend approaches aimed at thoughtfully…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Scientific Computing and Data Management · Big Data and Business Intelligence
MethodsDiffusion
