AIBench: An Agile Domain-specific Benchmarking Methodology and an AI Benchmark Suite
Wanling Gao, Fei Tang, Jianfeng Zhan, Chuanxin Lan, Chunjie Luo, Lei, Wang, Jiahui Dai, Zheng Cao, Xiongwang Xiong, Zihan Jiang, Tianshu Hao, Fanda, Fan, Xu Wen, Fan Zhang, Yunyou Huang, Jianan Chen, Mengjia Du, Rui Ren, Chen, Zheng, Daoyi Zheng, Haoning Tang, Kunlin Zhan

TL;DR
This paper introduces AIBench, an agile, domain-specific benchmarking methodology and suite that addresses the challenges of benchmarking modern AI and internet service workloads, providing industry-relevant, end-to-end benchmarks.
Contribution
It proposes a novel agile benchmarking methodology, identifies key application scenarios, and develops a flexible, extensible AI benchmark suite for industry and research use.
Findings
AIBench effectively benchmarks AI and internet services.
It outperforms MLPerf and TailBench in relevant metrics.
The benchmark suite is publicly available for community use.
Abstract
Domain-specific software and hardware co-design is encouraging as it is much easier to achieve efficiency for fewer tasks. Agile domain-specific benchmarking speeds up the process as it provides not only relevant design inputs but also relevant metrics, and tools. Unfortunately, modern workloads like Big data, AI, and Internet services dwarf the traditional one in terms of code size, deployment scale, and execution path, and hence raise serious benchmarking challenges. This paper proposes an agile domain-specific benchmarking methodology. Together with seventeen industry partners, we identify ten important end-to-end application scenarios, among which sixteen representative AI tasks are distilled as the AI component benchmarks. We propose the permutations of essential AI and non-AI component benchmarks as end-to-end benchmarks. An end-to-end benchmark is a distillation of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Materials Science · Software System Performance and Reliability · Ferroelectric and Negative Capacitance Devices
