TL;DR
Fathom provides a collection of eight representative deep learning workloads to analyze their performance characteristics, aiding hardware and software design for more flexible and efficient deep learning systems.
Contribution
This paper introduces Fathom, a benchmark suite of diverse deep learning models, and analyzes their performance to inform hardware and software optimization.
Findings
Identifies key performance bottlenecks in deep learning workloads
Highlights similarities and differences in workload behavior
Provides insights on parallelism effects on scaling
Abstract
Deep learning has been popularized by its recent successes on challenging artificial intelligence problems. One of the reasons for its dominance is also an ongoing challenge: the need for immense amounts of computational power. Hardware architects have responded by proposing a wide array of promising ideas, but to date, the majority of the work has focused on specific algorithms in somewhat narrow application domains. While their specificity does not diminish these approaches, there is a clear need for more flexible solutions. We believe the first step is to examine the characteristics of cutting edge models from across the deep learning community. Consequently, we have assembled Fathom: a collection of eight archetypal deep learning workloads for study. Each of these models comes from a seminal work in the deep learning community, ranging from the familiar deep convolutional neural…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
