AIBench: An Agile Domain-specific Benchmarking Methodology and an AI   Benchmark Suite

Wanling Gao; Fei Tang; Jianfeng Zhan; Chuanxin Lan; Chunjie Luo; Lei; Wang; Jiahui Dai; Zheng Cao; Xiongwang Xiong; Zihan Jiang; Tianshu Hao; Fanda; Fan; Xu Wen; Fan Zhang; Yunyou Huang; Jianan Chen; Mengjia Du; Rui Ren; Chen; Zheng; Daoyi Zheng; Haoning Tang; Kunlin Zhan; Biao Wang; Defei Kong; Minghe; Yu; Chongkang Tan; Huan Li; Xinhui Tian; Yatao Li; Gang Lu; Junchao Shao,; Zhenyu Wang; Xiaoyu Wang; Hainan Ye

arXiv:2002.07162·cs.PF·February 19, 2020·1 cites

AIBench: An Agile Domain-specific Benchmarking Methodology and an AI Benchmark Suite

Wanling Gao, Fei Tang, Jianfeng Zhan, Chuanxin Lan, Chunjie Luo, Lei, Wang, Jiahui Dai, Zheng Cao, Xiongwang Xiong, Zihan Jiang, Tianshu Hao, Fanda, Fan, Xu Wen, Fan Zhang, Yunyou Huang, Jianan Chen, Mengjia Du, Rui Ren, Chen, Zheng, Daoyi Zheng, Haoning Tang, Kunlin Zhan

PDF

Open Access

TL;DR

This paper introduces AIBench, an agile, domain-specific benchmarking methodology and suite that addresses the challenges of benchmarking modern AI and internet service workloads, providing industry-relevant, end-to-end benchmarks.

Contribution

It proposes a novel agile benchmarking methodology, identifies key application scenarios, and develops a flexible, extensible AI benchmark suite for industry and research use.

Findings

01

AIBench effectively benchmarks AI and internet services.

02

It outperforms MLPerf and TailBench in relevant metrics.

03

The benchmark suite is publicly available for community use.

Abstract

Domain-specific software and hardware co-design is encouraging as it is much easier to achieve efficiency for fewer tasks. Agile domain-specific benchmarking speeds up the process as it provides not only relevant design inputs but also relevant metrics, and tools. Unfortunately, modern workloads like Big data, AI, and Internet services dwarf the traditional one in terms of code size, deployment scale, and execution path, and hence raise serious benchmarking challenges. This paper proposes an agile domain-specific benchmarking methodology. Together with seventeen industry partners, we identify ten important end-to-end application scenarios, among which sixteen representative AI tasks are distilled as the AI component benchmarks. We propose the permutations of essential AI and non-AI component benchmarks as end-to-end benchmarks. An end-to-end benchmark is a distillation of the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning in Materials Science · Software System Performance and Reliability · Ferroelectric and Negative Capacitance Devices