DLBricks: Composable Benchmark Generation to Reduce Deep Learning Benchmarking Effort on CPUs (Extended)
Cheng Li, Abdul Dakkak, Jinjun Xiong, Wen-mei Hwu

TL;DR
DLBricks introduces a composable benchmark generation approach that simplifies and accelerates benchmarking of deep learning models on CPUs by decomposing models into reusable layers, enabling up-to-date and proprietary model evaluation.
Contribution
It proposes a novel method to automatically generate representative benchmarks from DL layers, reducing effort and time in benchmarking diverse and evolving models on CPUs.
Findings
Achieves up to 4.4x faster benchmarking time.
Provides 95% accuracy in performance estimation.
Supports proprietary models within benchmark suites.
Abstract
The past few years have seen a surge of applying Deep Learning (DL) models for a wide array of tasks such as image classification, object detection, machine translation, etc. While DL models provide an opportunity to solve otherwise intractable tasks, their adoption relies on them being optimized to meet latency and resource requirements. Benchmarking is a key step in this process but has been hampered in part due to the lack of representative and up-to-date benchmarking suites. This is exacerbated by the fast-evolving pace of DL models. This paper proposes DLBricks, a composable benchmark generation design that reduces the effort of developing, maintaining, and running DL benchmarks on CPUs. DLBricks decomposes DL models into a set of unique runnable networks and constructs the original model's performance using the performance of the generated benchmarks. DLBricks leverages two key…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
