$100K or 100 Days: Trade-offs when Pre-Training with Academic Resources
Apoorv Khandelwal, Tian Yun, Nihal V. Nayak, Jack Merullo, Stephen H. Bach, Chen Sun, Ellie Pavlick

TL;DR
This paper investigates the feasibility of academic researchers pre-training large models with limited resources, introduces a benchmark for measuring pre-training time, and demonstrates that significant reductions in GPU-days are achievable.
Contribution
It provides empirical data and a benchmark to assess pre-training feasibility for academics, challenging assumptions about resource limitations.
Findings
Replicating Pythia-1B is possible in 3x fewer GPU-days.
A benchmark for measuring pre-training time on academic GPUs is introduced.
Cost-benefit analysis clarifies trade-offs between price and training time.
Abstract
Pre-training is notoriously compute-intensive and academic researchers are notoriously under-resourced. It is, therefore, commonly assumed that academics can't pre-train models. In this paper, we seek to clarify this assumption. We first survey academic researchers to learn about their available compute and then empirically measure the time to replicate models on such resources. We introduce a benchmark to measure the time to pre-train models on given GPUs and also identify ideal settings for maximizing training speed. We run our benchmark on a range of models and academic GPUs, spending 2,000 GPU-hours on our experiments. Our results reveal a brighter picture for academic pre-training: for example, although Pythia-1B was originally trained on 64 GPUs for 3 days, we find it is also possible to replicate this model (with the same hyper-parameters) in 3x fewer GPU-days: i.e. on 4 GPUs in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHigher Education Learning Practices
Methods[[QBBO] USA]] Important)) 11 Ways To Contact Someone at QuickBooks Error Support · 18 Ways to Contact How can i speak to someone at Singapore Airlines: A Step by Step Guide
